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TRANSLATIONAL PROFILING 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims priority from U.S. Provisional Application No. 
60/279,495, filed March 28, 2001, U.S. Provisional Application No. 60/292,544, filed 
May 21, 2001, U.S. Provisional AppHcation No. 60/310,801, filed August 8, 2001, U.S. 
Provisional Application No. 60/326,370, filed October 1, 2001, U.S. Provisional 
Application No. 60/336,780, filed December 4, 2001, and U.S. Provisional Application 
No. 60/358,985, filed February 20, 2002. These applications are incorporated herein by 
reference in their entirety. 

FIELD OF THE INVENTION 
The invention relates to peptides identified by translational profiling methods, as 
well as nucleic acids encoding the peptides, methods of using the peptides to characterize 
the protein composition of a cell, and methods of using the peptides to diagnose, prevent, 
and treat disease. 

REFERENCE TO SEOUENCE USTING SUBMITTED ON A COMPACT DISC 
This application includes a compact disc (four copies of disc submitted) 
20 containing a sequence listing. The sequence listing is identified on the compact disc as 
follows. 



File Name 


Date of Creation 


Size (bytes) 


08191-026WOl.TXT 


March 25, 2002 


8,015,000 



The entire content of the sequence listing is herein incorporated by reference. 

25 

BACKGROUND OF THE INVENTION 
Essentially every cell within an organism contains the complete and identical 
genetic information of that organism, but expresses only a subset of that total 
complement of genes. For example, the human genonie, which is composed of a total of 
30 three billion nucleotides, is currently thought to include approximately 30,000-40,000 

1 
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genes. However, individual cells expresses only about 2,000 to about 4,000 different 
proteins, corresponding to only 10% of the total number of genes. It is the concerted 
activity of the proteins expressed in a given cell that orchestrates the activities that define 
a particular cell type at a given developmental, metabolic or disease stage. 
5 In the past decades it has become clear that the development and the pathology of 

many diseases involves differences in gene expression. Indeed, healthy and diseased 
tissue or cell types can frequently be distinguished by differences in gene expression. For 
example, normal cells may evolve to highly invasive and metastatic cancer cells by 
activation of certain growth-inducing genes, e,g,, oncogenes, or the inactivation of certain 

10 growth-inhibitory genes, tumor suppressors or apoptosis activators. Levine, 1997, 
Cell 88:323; Hunter, 1997, Cell 88:333; Jacobson, 1997, Cell 88:347; Nagata, 1997, Cell 
88:355; Fraser etal., 1996, Cell 85:781. Altered expression of such genes, e,g., growth 
activators or growth suppressors, in turn affects expression of other genes. See, The \ 
National Cancer fiistitute, "The Nation's Investment In Cancer Research: A Budget 

15 Proposal For Fiscal Years 1997/98", Prepared by the Director, National Cancer Institute, 
pp. 55-77. 

Pathological gene expression differences are not confined to cancer. 
Autoimmune disorders, many neurodegenerative diseases, inflammatory diseases, 
restenosis, atherosclerosis, many metabolic diseases, and numerous other disorders are 

20 believed to involve aberrant expression of particular genes. Naparstek et al.y 1993, Ann, 
Rev. Immunol 11:79; Sercarz etaL, 1993, Ann. Rev, Immunol 11:729. As a 
consequence, a challenge in medical research is to understand the role each gene or its 
encoded protein plays in maintaining normal cellular homeostasis and to utilize this 
heightened understanding in improving the ability to treat disease and/or identify 

25 predispositions to disease at stages when treatment and/or prevention methods are 
available. 

Significant resources have been expended to identify and isolate genes relevant to 
disease development One approach has been to sequence and catalogue all the 
individual genes contained in the genome of a species. In the case of humans, the NIH 
30 initiated the Humane Genome Project in 1990, with the goal to sequence the entire 

human genome by the year 2005. Stephens et al, 1990, Science 250:237; Cantor, 1990, 
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Science 248:49-51 . The near complete sequence of the human genome was published in 
advance of the 2005 target date. Venter et al., Science 2001 291:1304; International 
Human Genome Sequencing Consortium Nature 2001 409:860. However, the vast 
amount of information made available by the sequencing of the human genome is 
5 insufficient to resolve the mysteries of many disease processes because cellular function 
and dysfunction results from the concerted interaction and differential expression of 
proteins. Indeed, nucleotide sequence information alone does not indicate when, where, 
and how much of a given gene is expressed at the protein level. 

10 SUMMARY OF THE INVEimON 

The present invention is based on the purification of a series of peptide sequences 
derived from proteins produced within a panel of cells. The purification and sequencing 
of these peptides demonstrates both the existence of a given protein as well as the 
production of the given protein in a particular cell type. In many cases, the existence of a 

15 given protein was uncertain prior to the characterization describe herein, as it had never 
previously been isolated or even detected. Members of one class of peptides described 
herein, temned expressed protein tags (EPTs), bind to and are presented by human MHC 
class I or class n molecules. Members of a second class of peptides are chemically or 
enzymatically prepared from complex protein mixtures. 

20 The invention generally relates to novel peptides and proteins containing the 

novel amino acid sequences. In addition, the invention relates to nucleic acids encoding 
polypeptides containing the novel peptides, methods of using the peptide sequences in the 
context of a database or a peptide profile to characterize the protein composition of a cell 
or a peptide array comprising peptides of the invention, and using the identified peptides 

25 and corresponding nucleic acids in methods of treatment, diagnosis, and screening. 

In one aspect, the invention features a purified polypeptide including a peptide 
sequence selected from the group consisting of SEQ ID NOs:l-235. In an embodiment, the 
polypeptide comprises at least 8 contiguous amino acids of an amino add sequence selected 
from the group consisting of SEQ ID NOs: 1-235. In another embodiment, the invention 

30 features a purified immunogenic polypeptide comprising at least 8 contiguous amino acids 
of an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 
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"Immunogenic peptides" are peptides that result in or enhance an immune response in a 
mammal. Examples of immunogenic peptides can be found, for example in U.S. 5,827,516 
and U.S. 6,183,746. Jn another embodiment, the invention features a purified polypeptide, 
comprising at least an immunogenic portion of a protein, wherein the protein comprises an 
5 amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

In another aspect, the invention features a purified polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 1-235, wherein the 
purified polypeptide comprises at least 25 amino acids. In an example, the purified 
polypeptide comprises fewer than 100 amino acids. In another example, the purified 
10 polypeptide comprises fewer than 50 amino acids. 

In one embodiment, the polypeptide consists of a peptide sequence selected fiiom the 
group consisting of SEQ ID NOs: 1-235. In another embodiment, the polypeptide consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NOs: 1- 
235. 

15 The peptide sequence can be identical to that of a naturally processed class I MHC- 

binding peptide. Alternatively, the peptide sequence can be identical to that of a naturally 
processed class II MHC-binding peptide. 

In another aspect, the invention features an isolated nucleic acid encoding a 
polypeptide comprising a peptide sequence selected from the group consisting of SEQ ID 

20 NOs: 1-235. In an embodiment, the polypeptide comprises an amino acid sequence which is 
at least 95% identical to an amino acid selected from the group consisting of SEQ ID NOs: 
1-235. In another embodiment, the isolated nucleic acid comprises a nucleotide sequence 
that encodes a polypeptide comprising an amino add sequence selected from the group 
consisting of a variant of any one of SEQ ID NOs: 1-235, wherein the variant has no more 

25 than two conservative amino acid substitutions. In a further embodiment, the isolated 

nucleic acid comprises a nucleotide sequence that encodes a polypeptide comprising at least 
8 contiguous amino acids of an amino add sequence selected from the group consisting of 
SEQ ID NOs: 1-235. 

In some examples the encoded polypeptide includes a peptide sequence identical to 
30 that of a naturally processed class I MHC-binding peptide. Alternatively, the peptide 
sequence can be identical to that of a naturally processed class II MHC-binding peptide. 



4 



wo 02/078524 



PCT/US02/09671 



In one embodiment, an isolated nucleic add encodes a polypeptide including a 
peptide sequence identical to a segment of a naturally occuning protein, wherein the peptide 
sequence is selected from the group consisting of SEQ E) NOs: 1-235, and wherein the 
polypeptide does not include more than 10, 15, 20, 30, 40, 50, 60, 70 , 80 , 90, or 100 
5 consecutive amino acids identical to a portion of the naturally occurring protein. The 
peptide sequence can be identical to that of a naturally processed class I MHC-binding 
peptide. Alternatively, the peptide sequence can be identical to that of a naturally processed 
class U MHC-binding peptide. 

In an aspect, the invention features an isolated nucleic acid comprising a nucleotide 

10 sequence encoding a polypeptide consisting of an amino add sequence selected from the 
group consisting of SEQ ID NOs: 1-235. In another aspect, the isolated nucleic acid 
comprises a nucleotide sequence encoding a polypeptide consisting essentially of an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

The invention also includes an expression vector containing a nucleic acid described 

15 herein. In an example, the vector comprises expression control sequences that direct 

expression of the polypeptide. In another example, the vector comprises expression control 
sequences that direct expression of the nucleic add molecule. Also included in the 
invention is a cell containing an expression vector of the invention. 

In another aspect, the invention features an antibody specific for a polypeptide of 

20 the invention, e.g., a peptide sequence selected from the group consisting of SEQ ID NOs: 
1-235. In an example, the antibody selectively binds to the polypeptide which is expressed 
on a cell surface. In another example, the antibody of the polypeptide is a target of a second 
antibody located on a cell surface. 

In another aspect, the invention features a humanized antibody which specifically 

25 binds to a domain of a polypeptide comprising an amino add sequence selected fom the 
group consisting of SEQ ID NOs: 1-235 or an isolated nucleic acid which encodes the 
antibody. In preferred embodiments, the humanized antibody is a full length antibody, a 
human IgG, an antibody fragment and a F(ab)2. The invention also features a humanized 
antibody as described herein bound to a detectable label. In another aspect, the invention 

30 features an immobiUzed antibody comprising a humanized antibody as described herein 
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bound to a solid phase. In a further aspect, the invention features a conjugate comprising a 
humanized antibody as described herein bound to a cytotoxic agent. 

The invention also includes a method for determining the presence of a protein 
comprising exposing a sample suspected of containing the protein to a humanized antibody 
5 as described herein and determining binding of the antibody to the sample. In another 
aspect, the invention includes a kit comprising a humanized antibody as described herein 
and instructions for using the humanized antibody to detect a protein that binds to the 
antibody. 

The invention also includes a method of making an antibody, the method 

10 comprising: (a) providing a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NOs: 1-235 or a nucleic acid encoding such a polypeptide to 
a mammal in an amoimt effective to induce the production of an antibody that binds to the 
polypeptide; (b) isolating from the mammal a cell that produces an antibody that selectively 
binds to a polypeptide consisting of an amino add sequence selected from the group 

15 consisting of SEQ ID NOs: 1-235; (c) immortalizing the cell isolated in step (b); and (d) 
isolating antibodies from the immortalized cell. 

The invention also includes a method of modulating the activity of a polypeptide 
described herein, the method including contacting the polypeptide with a compound that 
binds to the polypeptide in a concentration sufficient to modulate the activity of the 

20 polypeptide. In an example, the compound that binds the polypeptide is an antibody that 
selectively binds a polypeptide consisting of an amino acid sequence selected for the 
group consisting of SEQ ID NOs: 1-235. 

In another aspect, the invention features a metiiod of treating a disorder in a 
mammal, the method including: (1) identifying a mammal with the disorder, and 

25 (2) administering to the mammal a compound that modulates the expression or activity of 
a polypeptide described herein, wherein the administration results in an amelioration of 
one or more symptoms of the disorder. The disorder can be for example a cellular 
proliferative and/or differentiative disorder or a disorder associated with the particular 
biological class of proteins to which the polypeptide belongs. 

30 In another aspect, the invention features a method for detecting the presence of a 

polypeptide described herein in a sample, die method including: (1) contacting the 
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sample with a compound that selectively binds to the polypeptide; and (2) detemiining 
whether the compound binds to the polypeptide in the sample. 

In another aspect, the invention features a method for detecting the presence of a 
disorder in a manomal, the method including: (1) providing a biological sample derived 
5 from the mammal; (2) contacting the sample with a compound that binds to a polypeptide 
described herein or to a nucleic acid that encodes such a polypeptide; and (3) determining 
whether the compound binds to the sample, wherein binding of the compound to the 
sample indicates the presence or absence of the disorder in the mammal. 

In another aspect, the invention features a method for imaging a site in a manmial, 

10 the method including: (1) administering a compound to a mammal, wherein the 

compound binds to a polypeptide described herein (or to a nucleic acid that encodes such 
a polypeptide) at the site in the mammal; and (2) detecting the compound with an 
imaging detector, to thereby image the site in the mammal. 

In another aspect, the invention features a method for identifying a compound that 

15 modulates the activity of a polypeptide described herein, the method including: 

(1) contacting a polypeptide described herein with a test compound; and (2) determining 
the effect of the test compound on the activity of the polypeptide, to thereby identify a 
compound that modulates the activity of the polypeptide. 

In another aspect, the invention features a method for identifying a compound that 

20 modulates the expression of a nucleic acid described herein, the method including: (1) 
contacting the nucleic acid with a test compound; and (2) determining the effect of the 
test compound on the expression of the nucleic acid, to thereby identify a compound that 
modulates the expression of the nucleic acid. 

In another aspect, the invention features a peptide profile that is characteristic for 

25 a given cell, wherein the profile includes a representation of at least ten different 

polypeptides in the cell, wherein each of the at least ten different polypeptides contains a 
peptide selected from the group consisting of SEQ ID NOs: 1-235, and wherein the 
peptide profile is a reproducible characteristic of the cell. In one example, the each of the 
at least ten different polypeptides contains an MHC-binding peptide. In one example, the 

30 representation characterizes each individual peptide based upon at least one physical or 
chemical attribute, the at least one physical or chemical attribute including amino acid 
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sequence. In addition, the representation can characterize each individual peptide based 
upon at least two physical or chemical attributes, e.g., wherein one of the physical or 
chemical attributes is amino acid sequence. For example, one of the physical or chemical 
attributes can be mass-to-charge ratio or ion-fragmentation pattern. In another example, the 
5 representation can characterize each individual peptide based upon at least three physical or 
chemical attributes. In another aspect, the invention features a polypeptide profile that is 
characteristic of a selected cell under selected conditions, wherein the profile comprises a 
representation of at least ten different polypeptides expressed by the cell, wherein each of 
the at least ten different polypeptides comprises an amino acid sequence selected ftom the 

10 group consisting of SEQ ID NOs: 1-235, and wherein the polypeptide profile is a 
reproducible characteristic of the cell. 

In another aspect, the invention features a database, stored on a machine-readable 
medium, containing: two categories of data respectively representing (a) peptide profiles 
and (b) cell sources; and associations among instances of the two categories of data, wherein 

15 the data representing peptide profiles include a peptide profile described herein, and wherein 
the database configures a computer to enable finding instances of data of one of the 
categories based on their associations with instances of data the other category. 

In another aspect, the invention features a database, stored on a machine-readable 
medium, comprising: (a) three categories of data respectively representing (i) 

20 polypeptides, (ii) cell sources, and (iii) ceD treatments; and (b) associations among 

instances of the three categories of data, wherein the data representing peptides comprises 
at least 100 polypeptides each having an amino acid sequence selected fit)m the group 
consisting of SEQ ID NOs: 1-235, and wherein the database configures a computer to 
enable finding instances of data of one of the categories based on their associations with 

25 instances of data of at least one other category. 

. In another aspect, the invention features a peptide array comprising at least 100 
peptides selected fi-om the group consisting of peptides consisting of an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 1-235, each peptide linked 
to a solid support at a known location. In another aspect the invention features a 

30 collection of at least 10 polypeptide arrays, each array comprising at least 100 
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polypeptides consisting of an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-235, each peptide linked to a solid support at a known location. 

In another aspect, the invention features a method of selecting an antibody, the 
method including; (1) contacting a polypeptide described herein with an in vitro library 
5 of antibodies; (2) binding an antibody to the polypeptide; and (3) selecting the antibody 
that binds to the polypeptide. 

In anoth^ aspect, the invention features an immunogenic composition comprising a 
polypeptide comprising an amino acid sequence selected from the group consisting of SEQ 
E) NOs: 1-235, the composition when injected into a mammal elicits an immunogenic 
10 response directed against a polypeptide consisting of an amdno acid sequence selected from 
the group consisting of SEQ ID NOs: 1-235. 

The invention also features a method for treating a cancer comprising 
administering to a patient in need of such treatment an amount of a composition 
comprising a polypeptide comprising an amino acid sequence selected from the group 
15 consisting of SEQ ID NOs: 1-235 in an amount sufficient to elicit an immunogenic 
response. 

The invention also features a method for treating a cancer patient, tiie method 
comprising administering to the patient an antibody that selectively binds to a peptide 
consisting of an amino acid sequence selected from the group consisting of SEQ ID 
20 NOs:l-235, 

In another aspect, die invention features a method for identifying a compound that 
binds to a naturally processed class I or class IE MHC-binding polypeptide, the method 
comprising exposing a test compound to a collection of at least 100 polypeptides selected 
from the group consisting of polypeptides having an amino acid sequence selected from 
25 the group consisting of SEQ ID NOs: 1-235, and identifying a peptide to which the test 
compound binds. 

An "isolated" or "purified" polypeptide, protein, or peptide (these terms are used 
interchangeably) is a polypeptide, protein, or peptide that is separated from those 
components (proteins and otiier naturally-occurring organic molecules) tiiat naturally 
30 accompany it. Typically, the polypeptide, protein, or peptide is substantially pure when it 
constitutes at least 60%, by weight, of the protein in the preparation. Preferably, the 
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protein in the preparation consists of at least 75%, more preferably at least 90%, and most 
preferably at least 99%, by weight, of the polypeptide, protein, or peptide of the 
invention. 

An "isolated" or "purified" nucleic acid refers to a nucleic acid that is separated 
5 from other nucleic acid molecules present in the natural source of the nucleic acid. With 
regards to genomic DNA, the temi "isolated" refers to a nucleic acid molecule that is free 
of sequences that naturally flank the nucleic acid (i.e., sequences located at the 5' and/or 
3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated nucleic acid molecule 

10 can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' 
nucleotide sequences that naturally flank the nucleic acid molecule in genomic DNA of 
the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 

15 chemical precursors or other chemicals when chemically synthesized. 

The term "nucleic acid" includes, for example, a recombinant DNA that is 
incorporated into a vector such as an autonomously replicating plasmid or virus. The 
nucleic acids herein can comprise ribonucleotides, deoxyribonucleotides, or modified 
forms of either nucleotide. Isolated nucleic acid sequences can be single or double 

20 stranded and can be polynucleotides or oligonucleotides. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Suitable methods and materials are described below, although 
methods and materials similar or equivalent to those described herein can also be used in 

25 the practice or testing of the present invention. All publications, patent applications, 
patents, and other references mentioned herein are incorporated by reference in their 
entirety. In case of conflict, the present specification, including definitions, will control. 
In addition, the materials, methods, and examples are illustrative only and not intended to 
be limiting. 

30 Other features and advantages of the invention will be apparent fix)m the 

following detailed description, and from the claims. 

10 
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DETATT.KD DESCRIPTION 
The present invention relates generally to peptide sequences identified by 
translational profiling methods. The invention also relates to polypeptides containing the 
5 peptide sequences, nucleic acids encoding polypeptides containing the peptide sequences, 
the use of these compositions in methods and systems for analyzing the protein 
composition of cells and cell populations, and methods of using the compositions in the 
diagnosis and treatment of disease as well as in the screening for therapeutic compounds 
to treat disease. 

10 

Polypeptides and Nucleic Acids 

The invention features purified polypeptides comprising a peptide sequence of 
any of SEQ ID NOs: 1-235. Polypeptides can be purified from cells or tissue sources 
using a variety of protein purification techniques. 

15 Methods of obtaining a purified preparation of a recombinant protein are well 

known in the art and include culturing transformed host cells under culture conditions 
suitable to express the protein, and purifying the resulting protein using known 
purification processes, such as gel filtration or ion exchange chromatography. The 
purification of the protein may also utilize an affinity colunm containing agents which 

20 will bind to the protein; one or more column steps over affinity resins such as 

concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA Sepharose®; one or 
more steps involving hydrophobic interaction chromatography using such resins as 
phenyl ether, butyl ether, or propyl ether, and/or immunoaffinity chromatography. 

Additionally, one or more reverse-phase high performance liquid chromatography 

25 (RP-HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having 

pendant methyl or other aliphatic groups, can be employed to further purify the protein. 
Some or all of the foregoing purification steps, in various combinations, can also be 
employed to provide a substantially homogenous isolated protein. - The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 

30 with the present invention as a "purified polypeptide." 
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A polypeptide can also be isolated from cells or tissue sources by using an affinity 
molecule to separate the polypeptide from a complex mixture of proteins. For example, a 
polypeptide can be purified by isolating a molecule, e.g., an MHC class I or class H 
molecule, to which the polypeptide is bound and eluting the polypeptide from the 
5 molecule. Alternatively, a polypeptide can be isolated from cells or tissue sources by 
using an anti-polypeptide antibody, e.g., an antibody described herein. Polypeptides or 
fragments thereof can also be synthesized chemically, e.g., by solid phase methods using 
an automated peptide synthesizer. Polypeptides can also be isolated and fragmented in 
vitro by the action of chemical or enzymatic treatments. 

10 The amino acid sequences of the peptides of SEQ ID NOs: 1-235 are presented in 

Table 1 (see Examples). This table indicates the "source protein symbols" from which 
each of the peptides is derived. Symbols are obtained from three places in the following 
order (a) gene symbol(s) and alias(es) from Locus link; (b) gene name(s) from 
LocusUnk; or (c) Locus titles from LocusLink. The table also provides SEQ ID NOs for 

15 each of the source proteins. The sequences corresponding to the SEQ ID NOs of these 
source proteins were obtained from GenBank™ accession numbers. The accession 
numbers can be viewed by entering (under a 'TProtein" search) the sequence for the 
"source protein reference" at www.ncbi.nhn.nih.gov/PubMed/. The entire content of 
each of this references is herein incorporated by reference. Many of the respective 

20 GenBank™ accessions also provide a reference to a nucleic acid sequence encoding the 
source protein. These nucleic acid sequences are also incorporated by reference in their 
entirety. 

In some embodiments, the polypeptide does not include more than 200 
consecutive amino acids, e.g., no more than 150, 100, 90, 80, 70, 60, 50, 40, or 30 amino 

25 acids, identical to a portion of a naturally occurring protein from which a peptide of SEQ 
ID NOs: 1-235 is derived. In other embodiments, the polypeptide consists of a peptide of 
any of SEQ ID NOs: 1-235, or a variant peptide as described below. In other 
embodiments, the polypeptide comprises at least 8 contiguous amino acids of an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 1-235. In another 

30 embodiment, the purified polypeptide comprises an amino acid sequence selected from 
the group consisting of SEQ ID NOs: 1-235, wherein the purified polypeptide comprises 

12 
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at least 25 amino acids. In other embodiments, the purijfied polypeptide comprises fewer 
than 100 or 50 amino acids. 

In another embodiment, the purified polypeptide consists of an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 1-235. In another 
5 embodiment, the purified polypeptide consists essentially of an amino acid sequence 
selected from the group consisting of SEQ BD NOs: 1-235. 

Other embodiments include a polypeptide that contains one or more changes in 
amino acid sequence, e.g., a change in an amino acid residue that is not essential for 
activity, e.g., the ability of the polypeptide to bind to a MHC molecule or to be 

10 recognized by an antibody described herein. Such polypeptides differ in amino acid 
sequence from SEQ ID NOs: 1-235, yet retain biological activity. In one embodiment, 
the polypeptide includes an amino acid sequence at least about 80%, 85%, 90%, 95%, 
98% or more identical to any of SEQ ID NOs: 1-235. In another embodiment the 
polypeptide comprises an amino acid sequence selected from the group consisting of a 

15 variant of any one of SEQ ID NOs: 1-235, wherein the variant has no more than two 
conservative amino acid substitutions. In another embodiment, the polypeptide 
comprises at least an immunogenic portion of a protein, wherein the protein comprises an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

The amino acid residues at particular positions in a polypeptide may include 

20 analogs, derivatives and congeners of any specific amino acid referred to herein. For 

example, the present invention contemplates the use of amino acid analogs wherein a side 
chain is lengthened or shortened while still providing a carboxyl, amino or other reactive 
precursor functional group for cyclization, as well as amino acid analogs having variant 
side chains with appropriate functional groups. For instance, the subject polypeptide can 

25 include an amino acid analog such as P-cyanoalanine, canavanine, djenkolic acid, 

norleucine, 3-phosphoserine, homoserine, dihydroxyphenylalanine, 5-hydroxytryptophan, 
1-methylhistidine, or 3-methylhistidine. Other naturally occurring anodno acid 
metabolites or precursors having side chains that are suitable herein will be recognized by 
those skilled in the art and are included in die scope of the present invention. Analogs of 

30 polypeptides can be generated by mutagenesis, such as by discrete point mutation(s), or 
by truncation. For instance, mutation can give rise to analogs that retain substantially the 
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same, or merely a subset, of the biological activity of the polypeptide from which it was 
derived. 

The polypeptides that can be utilized in the present invention also include analogs 
that are resistant to proteolytic cleavage such as those that, due to mutations, alter 
5 ubiquitination or other enzymatic targeting associated with the protein. 

Polypeptide analogs may also be chemically modified to create derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as gjycosyl 
groups, lipids, phosphate, acetyl groups and the hke. Covalent derivatives of proteins can 
be prepared by linking the chemical moieties to functional groups on amino acid 

10 sidechains of the protein or at the N-terminus or at the C-terminus of the polypeptide. 

Modification of the structure of the subject polypeptides can be for such purposes 
as enhancing stability (e.g., ex vivo shelf life and resistance to proteolytic degradation in 
vivo), or post-translational modifications (e.g., to alter the phosphorylation pattern of the 
polypeptide). Such modified peptides, when designed to retain at least one activity of a 

1 5 naturally-occurring form of the polypeptides disclosed herein, are considered to be their 
functional equivalents. Such modified peptides can be produced, for instance, by amino 
acid substitution, deletion, or addition. 

For example, it is reasonable to expect that an isolated replacement of a leucine 
with an isoleudne or valine, an aspartate with a glutamate, a threonine with a serine, or a 

20 similar replacement of an amino acid with a structurally related amino acid (i.e. isosteric 
and/or isoelfectric mutations) will not have a major effect on the biological activity of the 
resulting molecule. Thus, altered nucleic acid sequences encoding polypeptides which 
are encompassed by the invention include deletions, insertions, or substitutions of 
different nucleotides resulting in a polynucleotide that encodes the same or a functionally 

25 equivalent polypeptide. The encoded protein may also contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a 
functionally equivalent polypeptide. Deliberate amino acid substitutions may be made on 
the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, 
and/or the amphipathic nature of the residues as long as the biological activity of tiie 

30 polypeptide is retained. Conservative replacements are those that take place within a 
family of amino acids that are related in their side chains. Genetically encoded amino 
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acids are can be divided into four families: (1) acidic=aspartate, glutamate; (2) 
basic=Iysine, arginine, histidine; (3) nonpolar^alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan; and (4) uncharged polap=glycine, 
asparagine, glutamine, cysteine, serine, threonine, tyrosine (see, e.g.. Biochemistry, 2nd 
5 ed., Ed. by L. Stryer, W H Freeman and Co.: 1981). Whether a change in the amino acid 
sequence of a peptide results in a functional analog (e.g., functional in the sense that the 
resulting polypeptide mimics the wild-type form) can be readily determined by assessing 
the ability of the variant peptide to produce a response in cells in a fashion similar to the 
wild-type protein. Polypeptides in which more than one replacement has taken place can 
10 readily be tested in the same manner. 

As set forth above, alterations in primary sequence include genetic variations, 
both natural and induced. Also included are analogs that include residues other than 
naturally occurring L-amino acids, e.g., D-amino acids or non-naturally occurring or 
. synthetic amino acids, e.g., P or 5 amino acids. Alternatively, increased stability or 
15 . solubility may be conferred by cyclizing the peptide molecule, 

A polypeptide of the invention preferably does not contain a peptide sequence 
described in Tables 1-10 of U.S. Patent No. 5,827,516. 

The invention also features purified nucleic acids comprising nucleotides 
encoding polypeptides comprising amino acid sequences selected from the group 
20 consisting of SEQ ID NOs: 1-235 or an amino acid sequence which is at least 95% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NOs: 1- 
235. In another embodiment, the isolated nucleic acid comprises a nucleotide sequence 
that encodes a polypeptide comprising an amino acid sequence selected firom the group 
consisting of a variant of any one of SEQ ID NOs: 1-235, wherein tiie variant has no more 
25 than two conservative amino acid substitutions. 

In another embodiment, the isolated nucleic acid comprises a nucleotide sequence 
encoding a polypeptide consisting of an amino add sequence selected from the group 
consisting of SEQ ID NOs: 1-235. In another embodiment, the isolated nucleic acid 
comprises a nucleotide sequence that encodes a polypeptide comprising at least 8 
30 contiguous amino acids of an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-235. In another embodiment, the isolated nucleic acid comprises a 
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nucleotide sequence encoding a polypeptide consisting essentially of an amino add 
sequence selected from the group consisting of SEQ ID NOs:l-235, 

In a further enabodiment, the isolated nucleic acid comprises a nucleotide 
sequence encoding a polypeptide comprising no more than 30 contiguous amino acids of 
5 a naturally occurring human protein, wherein the naturally occurring protein comprises 
an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

A nucleic acid encoding a polypeptide described herein can be cloned into an 
expression vector, e.g., a vector in which the coding sequence is operably linked to 
expression control sequences. The need for, and identity of, expression control 

10 sequences will vary according to the type of cell in which the DNA is to be expressed. 
Generally, expression control sequences can include any or all of the following: a 
transcriptional promoter, enhancer, suitable mRNA ribosomal binding sites, translation 
start site, and sequences that terminate transcription and translation, including 
polyadenylation and possibly translational control sequences. Suitable expression control 

15 sequences can be selected by one of ordinary skill in the art. In one example, the vector 
comprises an expression control sequence that directs the expression of the polypeptides 
described herein. In another example, the vector comprises expression control sequences 
that direct expression of the nucleic acid molecule, as described herein. The nucleic acids 
encoding the polypeptides described herein may encode a methionine residue at the 

20 amino terminus of the polypeptide to facilitate translation. Standard methods can be used 
by the skilled person to construct expression vectors. See generally, Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor 
Press, N.Y. 

Vectors useful in this invention include linear DNA with transcriptional control 
25 elements, RNA, plasmid vectors, viral vectors, and bacterial vectors. A "plasmid" is an 
autonomous, self-replicating, extrachromosomal, circular DNA. Preferred viral vectors 
are those derived firom retroviruses, adenovirus, adeno-associated virus, pox vhnses, 
S V40 virus, alpha viruses or herpes vimses. 

Isolated nucleic acids can be used for the in vitro production of polypeptides of 
30 the invention. For example, a cell or cell line can be transfected, transformed, or infected 
with a nucleic acid described herein. After an incubation period that permits expression 
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of a polypeptide encoded by the nucleic acid, the polypeptide can be purified from the 
cell culture media, if secreted, or from a lysate of the cells expressing the polypeptide. 

Fusion Proteins 

5 The invention also provides fusion proteins. A "fusion protein" refers to a 

polypeptide containing a peptide sequence described herein, e.g., a peptide of any of SEQ 
ID NOs: 1-235, and a heterologous amino acid sequence. A "heterologous amino acid 
sequence" refers to a sequence of contiguous amino acids that is not contained within the 
protein from which the peptide sequence is derived, e.g,, a naturally occurring protein 

10 that contains any of SEQ ID NOs: 1-235. In other words, a fusion protein is not identical 
to a naturally occurring protein because it contains both a peptide sequence described 
herein as well as an ajnino acid sequence not contained within the naturally occurring 
protein from which the peptide sequence is derived. The fusion protein can contain a 
heterologous amino acid sequence fused to the N-terminus and/or C-teraiinus of the 

15 peptide sequence. 

The fusion protein can include a moiety that has a high affinity for a ligand. Such 
fusion proteins, e.g., GST-fusion proteins, can facilitate the purification of recombinant 
polypeptide. Fusion proteins can include all or a part of a serum protein, e.g., an IgG 
constant region, or human serum albumin. 

20 The fusion protein can include a trafficking sequence. A "trafficking sequence" is 

an amino acid sequence that causes a polypeptide to which it is fused to be transported to 
a specific compartment of the cell. An example of a trafficking sequence is a signal 
sequence. In certain host cells (e.g., mammalian host cells), expression and/or secretion 
of a polypeptide can be increased through use of a heterologous signal sequence. For 

25 example a signal sequence can be linked, with or with out a linker, to a polypeptide* 
described herein, e.g., a peptide of any of SEQ ID NOs:l-235. 

Fusion proteins of the invention can be used as immunogens. For example, 
administration of a fusion protein, or a nucleic acid encoding a fusion protein, can be 
used to elicit an immune response in a host, e.g., a mammal such as a mouse, rat, or 

30 human. Thus, the invention features an inmiunogenic composition comprising a 

polypeptide as described herein, the composition when injected into a mammal elicits an 
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immunogenic response directed against a polypeptide as described herein. The 
immunogenic response can be elicited by fragments of the polypeptide or nucleic acids 
encoding fragments of the polypeptide. Such fusion proteins may be useful in the 
development of antibodies, as described below. 

5 

Antibodies 

The invention also includes an antibody, multispecific antibodies (e.g., bispecific 
antibodies), or a fragment thereof (e.g., an antigen-binding fragment thereof) that is 
specific for a peptide sequence described herein, e.g., a peptide of any of SEQ ID NOs:l- 

10 235. The term "antibody" as used herein refers to an immunoglobuhn molecule or 
immunologically active portion thereof, i.e., an antigen-binding portion, including 
heterologous and chimeric antibodies.. The antibody can be a polyclonal or a 
monoclonal antibody. In other embodiments, the antibody can be recombinantly 
produced, e.g., produced by phage display or by combinatorial methods. "Antibody 

15 fragments" comprise a portion of a full length antibody, generally the antigen binding or 
variable region thereof. Examples of antibody fragments include Fab, Fab', F(ab')2. and 
Fv fragments and fragments produced by a Fab expression hbrary; diabodies; linear 
antibodies; single-chain antibody molecules; and multispecific antibodies formed from 
antibody fragments. Neutralizing antibodies, (i.e., those which inhibit dimer formation) 

20 are especially preferred for therapeutic use. Various techniques have been developed for 
the production of antibody fragments. Traditionally, these fragments were derived via 
proteolytic digestion of intact antibodies (see, e.g., Morimoto et al., Journal of 
Biochemical and Biophysical Methods 24: 107-117 (1992) and Brennan et al.. Science 
229: 81 (1985)). However, these fragments can now be produced directly by recombinant 

25 host cells. For example, Fab*-^H fragments can be directly recovered from E. coli and 
chemically coupled to form F(ab')2 fragments (Carter et al., Bio/Technology 10: 163-167 
(1992)). According to another approach, F{Bb% fragments can be isolated directly from 
recombinant host cell culture. F(ab')2 fragments can be produced by pepsin digestion of 
the antibody molecule and the Fab fragments which can be generated by reducing the 

30 disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments with the 
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desired specificity (Huse, W. D. et al. (1989) Science 254:1275-1281). Other techniques 
for the production of antibody fragments will be apparent to the skilled practitioner. 

It may be desirable to generate multispecific (e.g, bispecific) humanized 
antibodies, as described herein, having binding specificities for at least two different 
5 epitopes. Exemplary bispecific antibodies may bind to two different epitopes of a protein. 
Alternatively, an arm may be combined with an arm which binds to a triggering molecule 
on a leukocyte such as a T-cell receptor molecule (e.g*» CD2 or CDS), or Fc receptors for 
IgG (Fc.garama.R), such as Fc.gamma.RI (CD64), FcyRII (CD32) and Fc.gamma.Rin 
(0016) so as to focus cellular defense mechanisms to the protein expressing cell. 

10 Bispecific antibodies may also be used to localize cytotoxic agents to cells which express 
a protein. These antibodies possess a protein-binding arm and an arm which binds the 
cytotoxic agent (e.g., saporin, anti-interferon-.alpha., vinca alkaloid, ridn A chain, 
methotrexate or radioactive isotope hapten). Bispecific antibodies can be prepared as full 
length antibodies or antibody fragments (e.g., F(ab')2 bispecific antibodies). 

15 According to another approach for making bispecific antibodies, the interface 

between a pair of antibody molecules can be engineered to maximize the percentage of 
heterodimers which are recovered from recombinant cell culture. The preferred interface 
comprises at least a part of the Ch 3 domain of an antibody constant domain. la this 
method, one or more small amino acid side chains from the interface of the first antibody 

20 molecule are replaced with larger side chains (e.g., tyrosine or tryptophan). 

Compensatory "cavities" of identical or similar size to the large side chain(s) are created 
on the interface of the second antibody molecule by replacing large amino acid side 
chains with smaller ones (e.g., alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as 

25 homodimers. See WO96/2701 1 published Sep. 6, 1996. 

Bispecific antibodies include cross-linked or "heteroconjugate" antibodies. For 
example, one of the antibodies in the heteroconjugate can be coupled to avidin, the other 
to biotin. Heteroconjugate antibodies may be made using any convenient cross-Knking 
methods. Suitable cross-linking agents are well known in the art, and are disclosed in 

30 U.S. Pat. No. 4,676,980, along with a number of cross-linking techniques. 
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Techniques for generating bispeciific antibodies from antibody fragments have 
also been described in the literature. For example, bispecific antibodies can be prepared 
using chemical linkage. Brennan et al.. Science 229: 81 (1985) describe a procedure 
wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
5 fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide foimation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of 
the Fab -TNB derivatives is then reconverted to the Fab -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab -TNB 

10 derivative to form the bispecific antibody. The bispecific antibodies produced can be 
used as agents for the selective immobilization of enzymes. 

Fab -SH fragments can be recovered from E. coli, which can be chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med, 175: 217-225 (1992) 
describe the production of a fiilly humanized bispecific antibody F(ab')2 molecule. Each 

15 Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed 
was able to bind to cells overexpressing the HER2 receptor and normal human T cells, as 
well as trigger the lytic activity of human cytotoxic lymphocytes against human breast 
tumor targets 

20 Various techniques for making and isolating bispecific antibody fragments 

directiy from recombinant cell culture have also been described. For example, bispecific 
antibodies have been produced using leucine zippers. Kostelny et al., J. Inmiunol. 148(5): 
1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were linked 
to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers 

25 were reduced at the hinge region to form monomers and then re-oxidized to form the 
antibody heterodimers. This method can also be utilized for the production of antibody 
homodimers. The "diabody" technology described by Hollinger et al., Proc. Natl. Acad, 
Sci. USA 90: 6444-6448 (1993) has provided an alternative mechanism for making 
bispecific antibody fragments. The fragments comprise a heavy-chain variable domain 

30 (Vh) connected to a light-chain variable domain (Vl) by a linker which is too short to 
allow pairing between the two domains on the same chain. Accordingly, the Vh and Vl 
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domains of one fragment are forced to pair with the complementary Vl and Vh domains 
of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also 
been reported. See Gruber et al., J. Immunol. 152: 5368 (1994). Alternatively, the 
5 bispecific antibody may be a "linear antibody" produced as described in Zapata et al. 
Protein Eng. 8(10): 1057-1062 (1995). 

The term "monoclonal antibody" as used herein refers to an antibody obtained 
from a population of substantially homogeneous antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturally occurring mutations 

10 that may be present in minor amounts. Monoclonal antibodies are highly specific, being 
directed against a single antigenic site. Furthermore, in contrast to conventional 
(polyclonal) antibody preparations which typically include different antibodies directed 
against different determinants (epitopes), each monoclonal antibody is directed against a 
single determinant on the antigen. The modifier "monoclonal" indicates the character of 

15 the antibody as being obtained from a substantially homogeneous population of 

antibodies, and is not to be construed as requiring production of the antibody by any 
particular method. For example, the monoclonal antibodies to be used in accordance with 
the present invention may be made by the hybridoma method first described by Kohler et 
al.. Nature 256: 495 (1975), or may be made by recombinant DNA methods (see, e.g., 

20 U.S. Pat. No. 4,816,567). The "monoclonal antibodies" may also be isolated from phage 
antibody libraries using the techniques described in Clackson et al., Nature 352: 624-628 
(1991) and Marks et al., J. Mol. Biol. 222: 581-597 (1991), for example. 

The monoclonal antibodies herein specifically include "chimeric" antibodies 
(immunoglobulins) in which a portion of the heavy and/or light chain is identical with or 

25 homologous to corresponding sequences in antibodies derived fixim a particular species 
or belonging to a particular antibody class or subclass, while the remainder of the 
chain(s) is identical with or homologous to corresponding sequences in antibodies 
derived fix)m another species or belonging to another antibody class or subclass, as well 
as fragments of such antibodies, so long as they exhibit the desired biological activity 

30 (U.S. Pat. No. 4,816,567; and Morrison et al., Proc. Natl. Acad Sci: USA 81 : 6851-6855 
(1984)). 
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Polyclonal antibodies are preferably raised in animals by multiple subcutaneous 
(sc) or intraperitoneal (ip) injections of the relevant antigen and an adjuvant. It may be 
useful to conjugate the relevant antigen to a protein that is immunogenic in the species to 
be immunized, e.g., keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or 
5 soybean trypsin inhibitor using a bifunctional or derivatizing agent, for example, 
maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N- 
hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, 
SOCI2, or N=C=NR, where R and R^ are different alkyl groups. 

A polypeptide described herein, e.g., a peptide of any of SEQ ID NOs: 1-235, can 
10 be used as an immunogen or can be used to identify antibodies made with other 
immunogens, e.g., cells, membrane preparations, and the like. Polypeptides can be 
expressed on the cell surface enabling the binding of an antibody, as described herein, 
that is specific to the polypeptide. Alternately, an antibody described herein may bind to 
a polypeptide described herein, where the polypeptide is a target of a second antibody 
15 located on the cell surface. 

An antibody (e.g., a monoclonal antibody) can be used to isolate a polypeptide 
described herein by standard techniques, such as affinity chromatography or 
inmiunoprecipitation. Moreover, an antibody can be used to detect the polypeptide (e.g., 
in a cellular lysate or cell supernatant) in order to evaluate the abundance and pattern of 
20 expression of the protein. 

Furthermore, an antibody can be used to target a protein in vivo for a variety of 
purposes including disease screening, diagnosis, and treatment. For example, an 
antibody can be modified to include a toxin and/or a detectable label, as described herein. 
Antibodies coupled to a toxic agent can be particularly useful to target and destroy 
25 diseased or infected ceUs. 

An antibody can be coupled to a toxin, e.g., a polypeptide toxin, e.g., ricin or 
diphtheria toxin or active fi:agment thereof, or a radioactive nucleus, or imaging agent, 
e.g. a radioactive, enzymatic, or other, e.g., imaging agent, e.g., a NMR contrast agent 
Toxins can be optionally in an inactive state and be subject to activation following their 
30 administration to a subject (e.g., activation via radio energy, irradiation with x-rays, or 
other penetrating rays). Labels which produce detectable radioactive emissions or 
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fluorescence are preferred. Examples of detectable substances that can be coupled to an 
antibody include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, biolumdnescent materials, and radioactive materials. 

"Humanized" forms of non-human (e.g., murine) antibodies are chimeric 
5 antibodies which contain minimal sequence derived from non-human immunoglobulin. 
For the most part, humanized antibodies are human immunoglobulins (recipient 
antibody) in which hypervariable region residues of the recipient are replaced by 
hypervariable region residues from a non-human species (donor antibody) such as mouse, 
rat, rabbit or nonhuman primate having the desired specificity, .affinity, and capacity. 

10 The resulting antibody is one in which amino acids have been replaced in the non-antigen 
binding regions in order to more closely resemble a human antibody, while still retaining 
the original binding ability. In some instances, Fv framework region (FR) residues of the 
human inmiunoglobulin are replaced by corresponding non-human residues. 
Furthermore, humanized antibodies may comprise residues which are not found in the 

15 recipient antibody or in the donor antibody. These modifications are made to further 
refine antibody performance. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the hypervariable loops correspond to those of a non-human 
immunoglobulin and all or substantially all of the FR regions are those of a human 

20 immunoglobulin sequence. The humanized antibody optionally also will comprise at least 
a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. For further details, see Jones et al., Nature 321: 522-525 (1986); 
Reichmann et al., Nature 332: 323-329 (1988); and Presta, Curr. Op. Struct Biol. 2: 593- 
596 (1992). 

25 "Single-chain Fv" or "sFv" antibody fragments comprise the Vh and V.l domains 

of antibody, wherein these domains are present in a single polypeptide chain. Generally, 
the Fv polypeptide further comprises a polypeptide linker between the Vh and Vl 
domains which enables the sFv to form the desired structure for antigen binding. For a 
review of sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 113, 

30 Rosenburg and Moore eds. Springer- Verlag, New York, pp. 269-315 (1994). 
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Humanized antibodies can be produced, for example by transgenic non-human 
animals. Such animals are capable of producing heterologous antibodies of multiple 
isotypes. Heterologous antibodies are encoded by immunoglobulin heavy chain genes 
not normally found in the genome of that species of non-human animal. Transgenic non- 

5 human animals (e.g., mammals) can be of a variety of species including murine (rodents 
(e.g., mice, rats), avian (chicken, turkey, fowl), bovine (beef, cow, cattle), ovine Qamb, 
sheep, goats), porcine (pig, swine), and piscine (fish). Transgenic non-human animals 
can be produced by introducing transgenes into the germline of the non-human animal. 
A "transgene" means a nucleic acid sequence (encoding, e.g., a human Fc receptor), 

10 which is partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into 
which it is introduced, or, is homologous to an endogenous gene of the transgenic animal 
or cell into which it is introduced, but which is designed to be inserted, or is inserted, into 
the animal's genome in such a way as to alter the genome of the cell into which it is 
inserted (e.g., it is inserted at a location which differs from that of the natural gene or its 

15 insertion results in a knockout). A transgene can include one or more transcriptional 
regulatory sequences and any other nucleic acid, such as introns, that may be necessary 
for optimal expression of a selected nucleic acid. Methods of producing transgenic 
animals and humanized antibodies are for example described in U.S. patents 5,569,825, 
5,770,429, and 6,11,166. 

20 Humanized antibodies can be bound to labels or be in the form of a conjugate 

bound to a cytotoxic agent. The word "label" when used herein refers to a detectable 
compound or composition which is conjugated directly or indirectly to the antibody. The 
label may itself be detectable by itself (e.g., radioisotope labels or fluorescent labels) or, 
in the case of an enzymatic label, may catalyze chemical alteration of a substrate 

25 compound or composition which is detectable. The term "cytotoxic agent" as used 
herein refers to a substance that inhibits or prevents the function of cells and/or causes 
destruction of cells. The term is intended to include radioactive isotopes (e.g., iP^, I*^, 
and Re^^, chemotherapeutic agents, and toxins such as enzymatically active toxins 
of bacterial, fungal, plant or animal origin, or fragments thereof. A "chemotherapeutic 

30 agent" is a chemical compound useful in the treatment of cancer. Examples of 

chemotherapeutic agents include Adriamycin, Doxorubicin, 5-Huorouracil, Cytosine 



24 



wo 02/078524 



PCT/US02/09671 



arabinoside ("Ara-C"), Cyclophosphamide, Thiotepa, Taxotere (docetaxel), Busulfan, 
Cytoxin, Taxol, Methotrexate, Cisplatin, Melphalan, Vinblastine, Bleomycin, Etoposide, 
Ifosfamide, Mitomycin C, Mitoxantrone, Vincreistine, Vinorelbine, Carboplatin, 
Teniposide, Daunomycin, Carminomycin, Aminopterin, Dactinomycin, Mitomycins, 
5 Esperamicins (see U.S. Pat. No. 4,675,187), Melphalan and other related nitrogen 
mustards. 

Covalent modifications of the humanized antibody are also included within the 
■ scope of this invention. They may be made by chemical synthesis or by enzymatic or 
chemical cleavage of the antibody, if applicable. Other types of covalent modifications of 

10 the antibody are introduced into the molecule by reacting targeted amino acid residues of 
the antibody with an organic derivatizing agent that is capable of reacting with selected 
side chains or the N- or C-terminal residues. 

Cysteinyl residues most commonly are reacted with .alpha, -haloacetates (and 
corresponding amines), such as chloroacetic acid or chloroacetamdde, to give 

15 carboxymethyl or caitoxyaraddomethyl derivatives. Cysteinyl residues also are 
derivatized by reaction with bromotrifluoroacetone, .alpha.-bromo-.beta. -(5- 
imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl 
disulfide, methyl 2-pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4- 
nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3-diazole. 

20 Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5,5- 

7.0 because this agent is relatively specific for the histidyl side chain. Para- 
bromophenacyl bromide also is useful; the reaction is preferably performed in 0.1 M 
sodium cacodylate at pH 6.0. 

Lysinyl and amino-terminal residues are reacted with succinic or other carboxylic 

25 acid anhydrides. Derivatization with these agents has the effect of reversing the charge of 
the lysinyl residues. Other suitable reagents for derivatizing .alpha.-amino-containing 
residues include imidoesters such as methyl picolinimidate, pyridoxal phosphate, 
pyridoxal, chloroborohydride, trinitrobenzenesulfonic acid, 0-methylisourea, 2,4- 
pentanedione, and transaminase-catalyzed reaction with glyoxylate. 

30 Arginyl residues are modified by reaction mth one or several conventional 

reagents, among them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and 

25 
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ninhydrin. Derivatization of arginine residues requires that the reaction be performed in 
alkaline conditions because of the high pKa of the guanidine functional group. 
Furthermore, these reagents may react with the groups of lysine as well as the arginine 
epsilon-amino group. 

5 The specific modification of tyrosyl residues may be made, with particular 

interest in introducing spectral labels into tyrosyl residues by reaction with aromatic 
diazonium compounds or tetranitromethane. Most commonly, N-acetylimidizole and 
tetranitromethane are used to form 0-acetyl tyrosyl species and 3-nitro derivatives, 
respectively. Tyrosyl residues are iodinated using I or I to prepare labeled proteins 

10 for use in radioimmunoassay. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction 
with carbodiimides (R-N=C=N-RO, where R and R' are different alkyl groups, such as 1- 
cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 
dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted 

15 to asparaginyl and glutaminyl residues by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the 
corresponding glutamyl and aspartyl residues, respectively. These residues are 
deamidated under neutral or basic conditions. The deamidated form of these residues falls 
within the scope of this invention. 

20 Other modifications include hydroxylation of proline and lysine, phosphorylation 

of hydroxyl groups of seryl or threonyl residues, methylation of the .alpha.-amino groups 
of lysine, arginine, and histidine side chains (T. E. Creighton, Proteins: Structure and 
Molecular Properties, W.IL Freeman & Co., San Francisco, pp. 79-86 (1983)), 
acetylation of the N-tenninal amine, and amidatipn of any C-terminal carboxyl group. 

25 Another type of covalent modification involves chemically or enzymatically 

coupling glycosides to the antibody. These procedures are advantageous in that they do 
not require production of the antibody in a host cell that has glycosylation capabilities for 
N- or 0-linked glycosylation. Depending on the coupling mode used, the sugar(s) may be 
attached to (a) arginine and histidine, (b) free carboxyl groups, (c) fijee sulfhydryl groups 

30 such as those of cysteine, (d) free hydroxyl groups such as those of serine, threonine, or 
hydroxyproline, (e) aromatic residues such as those of phenylalanine, tyrosine, or 
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tryptophan, or (f) the amide group of glutamine. These methods are described in 
WO87/05330 published 11 Sep. 1987, and in ApKn and Wriston, CRC Crit. Rev. 
Biochem., pp. 259-306 (1981), 

Removal of any carbohydrate moieties present on the antibody may be 

5 accomplished chemically or enzymatically. Chemical deglycosylation requires exposure 
of the antibody to the compound trifluoromethanesulfonic acid, or an equivalent 
compound. This treatment results in the cleavage of most or all sugars except the Unking 
sugar (N-acetylglucosamine or N-acetylgalactosamine), while leaving the antibody intact. 
Chemical deglycosylation is described by Hakimuddin, et al. Arch. Biochem. Biophys. 

10 259: 52 (1987) and by Edge et al. Anal. Biochem., 118: 131 (1981). Enzymatic cleavage 
of carbohydrate moieties on antibodies can be achieved by the use of a variety of endo- 
and exo-glycosidases as described by Thotakura et al. Meth. Enzymol. 138: 350 (1987). 

Another type of covalent modification of the antibody comprises linking the 
antibody to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 

15 polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Pat. NOs. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Humanized antibodies can also be immobilized to a solid phase. By "solid phase" 
is meant a non-aqueous matrix to which the antibody of the present invention can adhere. 
Examples of solid phases encompassed herein include those formed partially or entirely 

20 of glass (e.g. controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, 
polystyrene, polyvinyl alcohol and silicones. In certain embodiments, depending on the 
context, the solid phase can comprise the well of an assay plate; in others it is a 
purification column (e.g. an affinity chromatography column). This term also includes a 
discontinuous solid phase of discrete particles, such as those described in U.S. Pat. No. 

25 4,275,149. 

Diagnostic and therapeutic uses for the antibody are contemplated. In one 
diagnostic application, the invention provides a method for determining the presence of a 
protein comprising exposing a sample suspected of containing the protein to the antibody 
and determining binding of the antibody to the sample. For this use, the invention 
30 provides a kit comprising the antibody and instructions for using the antibody to detect 
the protein. 
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Also included in the invention is an isolated nucleic acid, as described herein, 
encoding a humanized antibody, described herein, as well as a vector comprising the 
nucleic acid and a cell comprising the vector. 

Antibodies can be used to modulate the activity of a polypeptide of the invention, 
as described herein. The invention includes a method for modulating the activity of the 
polypeptide of the invention, the method comprising contacting the polypeptide v^th a 
compound that binds to the polypeptide in a concentration sufQcient to modulate the activity 
of the polypeptide. The compound that binds to the polypeptide can be an antibody as 
described herein. 

The invention also features a method of making an antibody, the method comprising 
(a) providing a polypeptide described herein to a manmial in an amount effective to induce 
the production of an antibody that binds to the polypeptide; (b) isolating from the mammal a 
cell that produces an antibody tiiat selectively binds to a polypeptide as described herein; (c) 
immortalizing the cell isolated in step (b); and (d) isolating antibodies from the immortalized 
cell. 

The invention also includes a method of selecting an antibody, the method 
comprising: (a) contacting a polypeptide as described herein with an in vitro library of 
antibodies; (b) binding an antibody to the polypeptide; and (c) selecting the antibody that 
binds to the polypeptide. 

The invention also includes a nucleic acid that encodes an antibody described 
herein. Also included are vectors that include the nucleic acid and cells transformed with 
the nucleic acid, particularly cells which are useful for producing an antibody, e.g., 
manunalian cells, e.g. CHO or lymphatic cells. 

The invention also includes cell lines, e.g., hybridomas, which make an antibody 
described herein, and method of using said cells to make an antibody. 

Also included in the invention are anti-peptide antibodies. An anti-peptide 
antibody is an antibody that binds to the amino acid sequence of a peptide described 
herein, e.g., a peptide of any of SEQ ID NOs: 1-235. In one example, the antibody is 
capable of recognizing the peptide when the peptide is bound to an MHC class I or 
class n molecule. The antibody can recognize either the peptide sequence or a 
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combination of the peptide sequence and an MHC molecule. See, e.g., Apostolopoulos et 
aly 1998, /. Immunol 161:767 for a description of anti-peptide antibodies. 

The anti-peptide antibodies can be used to detect the expression of a protein 
within a cell (e.g., detection of a processed peptide on the cell surface by an anti-peptide 

5 antibody indicates that the protein, e.g., intracellular protein, is expressed within the cell). 
Such an anti-peptide antibody can be particularly useful for determining the protein 
composition of a cell when the cell is subjected to varying conditions or stimuli. 
Additionally, an anti-peptide antibody can be useful for detecting the presence of a 
disease-associated antigen within a cell. For example, a cell can be diagnosed as 

10 containing a cancer-related protein by detecting a peptide described herein presented by 
an MHC molecule on the surface of the cell. Antibodies raised against peptides can also 
be used therapeutically to treat human maladies. For example, such an antibody can be 
modified to contain a reagent, e.g., a toxin, that damages or destroys diseased or infected 
cells to which it binds. 

15 

Gene Discovery 

The human genome has been reported to contain approximately 30,000-40,000 
genes, a number significantly lower than previous estimates of 100,000 or more genes. 
Venter et al., Science 2001 291:1304; Iitemational Human Genome Sequencing 

20 Consortium Nature 2001 409:860. One possible explanation for this discrepancy is that 
computer algorithms used to analyze raw nucleotide sequence and identify genes may not 
have detected a subset of the genes in the human genome. Because the peptides 
described herein correspond to portions of actual proteins actually produced by a cell, the 
compositions and methods of the invention allow for the identification of as yet 

25 unidentified genes. For example, those peptides that do not match to any known genes 
may represent the protein product of a novel gene. 

A peptide sequence described herein can be compared to a predicted translation of 
human genomic sequence (a predicted translation of each strand of genomic DNA, in 
three reading frames), ff this analysis identifies a matching sequence, then a careful 

30 analysis of the reading frame encoding the peptide should allow for identification of the 
remainder of the gene encoding the peptide, including but not limited to coding 
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sequences, 5' and 3' untranslated regions, alternatively spliced exons, introns, promoters, 
enhancers, and silencer or repressor elements. 

In addition to sequence analysis, a gene and/or a cDNA encoding a protein 
containing a peptide described herein can be isolated by methods well known to those of 

5 skill in the art. Isolation of a gene or a cDNA is especially relevant for peptides that lack 
a genomic match, but can also be useful to verify the nucleotide sequence that encodes 
any peptide. The skilled artisan will appreciate that a number of methods are known in the 
. art to identify and isolate genes or cDNAs using amino acid information, and will know how 
to identify and practice such methods. See, for example, Sambrook et a/., 1989 Molecular 

10 Cloning: A Laboratory Manual 2nd ed. Cold Spring Harbor Laboratory Press; Ausubel et 
al . Current Protocols in Molecular Biology, Greene PubHshing Associates and Wiley 
Interscience, N.Y. (current edition). Such methods include the preparation of degenerate 
probes or primers based upon the peptide amino acid sequence and using such primers for 
identification and/or amplification of genes and or cDNAs in appropriate libraries or 

15 other sources of genomic materials. The chromosomal location of the gene encoding the 
protein from which a peptide is derived may be determined, for example, by hybridizing 
appropriately labeled nucleic acids to chromosomes in situ. 

Detection of Protein Expression 

20 The compositions and methods described herein can be used to determine the 

protein composition of a cell. The detection of mRNA within a cell, for example by 
Northern analysis or RT-PCR, does not indicate whether the mRNA is translated, much 
less how much of the corresponding protein is produced in the cell. Detection of a 
peptide described herein indicates that the protein from which it is derived has been 

25 produced by the cell. Thus, the invention includes a method of determining the protein 
composition of a cell (or tissue sample) by detecting the presence of a peptide described 
herein to thereby determine that the cell (or tissue sample) expresses the protein from 
which the peptide is derived. The method can be used to determine the presence of a 
peptide and/or the protein from which it is derived, and optionally the quantity of a 

30 peptide and/or protein produced by a cell. 
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In addition to the "translational verification" described above, the peptides can be 
used to detennine the reading frame that is being used by a gene. For example, the 
detection of an mRNA or a portion of an mRNA does not automatically indicate the 
amno acid sequence of the corresponding protein. The peptides described herein can 
5 thus be used to discover reading frames of genes that are being expressed. 

Protein Classifications 

The peptides described herein belong to a v^de variety of functional biological 
classes. Many of the classes to which particular peptides belong are described in the 
10 Table presented in the Examples. Members of many of these classes of proteins have 
been well-characterized as participating in important biological pathways and/or have 
been implicated in a variety of disease conditions. Several of these classes are described 
in more detail below. 

15 Kinases 

As described in Examples 1 and 2 (and the accompanying table), many of the 
peptides described herein are derived from proteins that appear to be kinases. Kinases 
catalyze the transfer of high energy phosphate groups from a phosphate donor to a 
phosphate acceptor. Nucleotides usually serve as the phosphate donor in these reactions, 

20 with most kinases utilizing adenosine triphosphate (ATP). Reversible protein 

phosphorylation is a primary method for regulating protein activity in eukaryotic cells. In 
general, proteins are activated by phosphorylation in response to extracellular signals 
such as hormones, neurotransmitters, and growth and differentiation factors, llie 
activated proteins initiate the cell's intracellular response by way of intracellular 

25 signaling pathways and second messenger molecules, such as cyclic nucleotides, 
calcium-calmodulin, inositol, and various mitogens, that regulate protein 
phosphorylation. 

Kinases are involved in many aspects of a cell's function, from basic metabolic 
processes such as glycolysis, to cell-cycle regulation, differentiation, and communication 
30 with the extracellular environment through signal transduction cascades. Kinase targets 
include proteins, inositol, lipids, and nucleotides. Inappropriate phosphorylation of 
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proteins in cells has been linked to changes in cell cycle progression and cell 
differentiation. Changes in cell cycle progression have been linked to induction of 
apoptosis or cancer. Changes in cell differentiation have been linked to diseases and 
disorders of the reproductive system, immune system, and skeletal muscle. 
5 Table 1 lists several of the peptides described herein that appear, based upon 

structural homology, to belong to the kinase superfamily. These peptides (and their 
corresponding source proteins) can therefore be used to treat disorders associated with 
inappropriate phosphorylation of kinase targets, e.g., disorders associated with changes in 
cell cycle progression and/or cell differentiation, or to screen for agonists and antagonists 

10 useful for the same purpose. In addition, nucleic acids encoding the proteins as well as 
compounds (e.g., antibodies) that recognize the proteins can be used in a wide variety of 
applications described herein, including therapeutics, diagnostics, and drug screening. 

The invention therefore includes the following peptides as kinases: SEQ ID N0:7, 
SEQ ID N0:12, SEQ ID NO:32, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:45, SEQ 

15 ID NO:85, SEQ ID NO:90, SEQ ID NO:95, SEQ ID N0:118, SEQ ID NO:140, SEQ ID 
N0:181, and SEQ ID NO:185. 

Phosphatases 

As described in Examples 1 and 2 (and the accompanying table), many of the 
20 peptides described herein are derived from proteins that appear to be phosphatases. 

Phosphatases are characterized as tyrosine-specific or serine/threonine-specific based on 
their preferred phospho-amino acid substrate. Some phosphatases exhibit dual specificity 
for both phospho-tyrosine and phospho-serine/threonine residues. 

Serine/threonine phosphatases play important roles in glycogen metabolism, 
25 muscle contraction, protein synthesis, oocyte maturation, and hepatic metabolism. 
(Cohen, P. (1989) Annu. Rev. Biochem. 58:453-508). Tyrosine phosphatases play 
important roles in lymphocyte activation and cell adhesion. In addition, Hit genes 
encoding several tyrosine phosphatases have been mapped to chromosomal regions that 
are translocated or rearranged in various neoplastic conditions, including lymphoma, 
30 leukemia, small cell lung carcinoma, adenocarcinoma, and neuroblastoma (Charbonneau, 
H. and Tonks, N. K. (1992) Annu. Rev. Cell Biol. 8:463-493). Because cellular 
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transformation is often accompanied by increased phosphorylation activity, the regulation 
of phosphorylation activity by phosphatases may therefore be an important strategy for 
controlling some types of cancer. 

Table 1 lists several of the peptides described herein that appear, based upon 
5 structural homology, to belong to the phosphatase superfamily. These peptides (and their 
corresponding source proteins) can therefore be used to treat disorders associated with 
inappropriate phosphorylation and/or phosphatase activity, or to screen for agonists and 
antagonists useful for the same purpose. These protein phosphatases and the nucleic 
acids encoding them allow for the manufacture of new compositions that are useful in the 
10 diagnosis, prevention, and treatment of disorders such as immune system disorders, cell 
proliferative and differentiative disorders (including cancer), and neurological disorders. 

The invention therefore includes the following peptides as phosphatases: SEQ ID 
N0:18, SEQ ID NO:24, SEQ ID NO:76, SEQ ID NO:103, SEQ ID NO:125, SEQ ID 
NO:199, SEQ ID NO:224, and SEQ ID NO:231 

15 

Proteases and Protease Inhibitors 

As described in Examples 1 and 2 (and the accompanying table), many of the 
peptides described herein are derived from proteins that appear to be proteases. Proteases 
cleave proteins and peptides at the peptide bond that forms the backbone of the protein or 

20 peptide chain. Proteolytic processing is an essential component of cell growth, 

differentiation, remodeling, and homeostasis. The cleavage of peptide bonds within cells 
is necessary for the maturation of precursor proteins to their active forms, the removal of 
signal sequences from targeted proteins, the degradation of incorrectly folded proteins, 
and the controlled turnover of peptides within the cell. 

25 Proteases participate in apoptosis (and disorders associated with inappropriate 

levels of apoptosis) as well as tissue remodeling during embryonic development, wound 
healing, and normal growth. Proteases are involved in the etiology or progression of 
disease states such as inflammation, angiogenesis, tumor dispersion and metastasis, 
cardiovascular disease, neurological disease, and bacterial, parasitic, and viral infections. 

30 For example, caspases and components of caspase signaling pathways regulate apoptosis 
and/or inflammation in an individual. 
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Protease inhibitors and other regulators of protease activity control the activity 
and effects of proteases. Protease inhibitors have been shown to control pathogenesis in 
animal models of proteolytic disorders and in the treatment of HIV (Murphy, G. (1991) 
Agents Actions Suppl. 35:69-76). 
5 Table 1 lists several of the peptides described herein that appear, based upon 

structural homology, to be proteases. These peptides (and their corresponding source 
proteins) can therefore be used to treat disorders associated with inappropriate protease 
expression or activity. Examples of such disorder include immunological disorders 
(including autoimmune or inflammatory disorders), angiogenesis, tumor dispersion and 

10 metastasis, cardiovascular disease, neurological disease, and pathogenic infections, or to 
screen for agonists and antagonists useful for the same purpose. In addition, nucleic 
acids encoding the proteins as well as compounds (e.g., antibodies) that recognize the 
proteins can be used in a wide variety of applications described herein, including 
therapeutics, diagnostics, and drug screening. 

15 The invention therefore includes the following peptides as proteases: SEQ ID 

NO:75, SEQ ID NO:93, SEQ ID NO:163, SEQ ID NO:169. and SEQ ID NO:200. 

Transporters 

As described in Examples 1 and 2 (and the accompanying table), many of the 
20 peptides described herein are derived from proteins that appear to be transporters. 
Transporter proteins are used to facilitate the translocation of certain molecules either 
into or out of the cell. Often, such transporters work by "pumping" ions across the cell 
membrane and co-transporting specific molecules (e.g., amino acids, amino acid 
derivatives and precursors, dicarboxylates, or inorganic molecules) across the membrane. 
25 Such mechanisms play important roles in maintaining cellular and metabolic 

homeostasis, neuron function, signaling, and drug resistance. As such, transporter 
proteins constitute compelling targets for the development of novel therapeutic agents. 

The electrical potential of a cell is generated and maintained by controlling the 
movement of ions across the plasma membrane. The movement of ions requires ion 
30 channels, which form ion selective pores within the membrane. There are two basic types 
of ion channels, ion transporters and gated ion channels. Ion transporters utilize the 



34 



wo 02/078524 



PCT/US02/09671 



energy obtained from ATP hydrolysis to actively transport an ion against the ion's 
concentration gradient. Gated ion channels allow passive flow of an ion down the ion's 
electrochemical gradient under restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) 
5 electrical impulse conduction down the axon of a nerve cell, 2) transport of molecules 
into cells against concentration gradients, 3) initiation of muscle contraction, and 4) 
endocrine cell secretion. 

The etiology of numerous human diseases and disorders can be attributed to 
defects in the transport of molecules across membranes. Defects in the trafficking of 

10 membrane-bound transporters and ion channels are associated with several disorders, 
e.g., cystic fibrosis, glucose-galactose malabsorption syndrome, hypercholesterolemia, 
von Gierke disease, and certain forms of diabetes mellitus. Single-gene defect diseases 
resulting in an inability to transport small molecules across membranes include, e.g., 
cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant Hoff, W.G. (1996) 

15 Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332: 1475-1480). 

Human diseases caused by mutations in ion channel genes include disorders of 
skeletal muscle, cardiac muscle, and the central nervous system. Mutations in the pore 
forming subunits of sodium and chloride channels cause myotonia, a muscle disorder in 

20 which relaxation after voluntary contraction is delayed. Sodium channel myotonias have 
been treated with channel blockers. Mutations in muscle sodium and calcium channels 
cause fonns of periodic paralysis, while mutations in the sarcoplasmic calcium release 
channel and muscle sodium channel cause malignant hyperthermia. Cardiac arrythmia 
disorders such as the long QT syndromes and idiopathic ventricular fibrillation are caused 

25 by mutations in potassium and sodium channels (Cooper, E.C. and L.Y. Jan (1998) Proc. 
Natl. Acad Sci. USA 96:4759-4766), All four known human idiopathic epilepsy genes 
code for ion channel proteins (Berkovic, S.R and IE. Scheffer (1999) Curr. Opin. 
Neurology 12:177 182). Other neurological disorders such as ataxias and hereditary 
deafness can also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. 

30 NeurobioL 9:274-280). 
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Ion channels have been the target for many drug therapies. In particular, 
neurotransmitter-gated channels have been targeted in therapies for treatment of 
insomnia, anxiety, depression, and schizophrenia. Voltage-gated channels have been 
targeted in therapies for arrhythmia, ischemia, stroke, head trauma, and 
5 neurodegenerative disease (Taylor, CP. and L. S. Narasimhan (1997) Adv. Pharmacol. 
39:47-98). 

Various classes of ion channels also play an important role in the perception of 
pain, and thus are potential targets for new analgesics. These include the vanilloid-gated 
ion channels, which are activated by the vanilloid capsaicin, as well as by noxious heat. 

10 Local anesthetics such as lidocaine and mexiletine which blockade voltage-gated ion 
channels have been useful in the treatment of neuropathic pain. 

Ion channels in the immune system have been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set 
of T-cell specific ion channels has been characterized that affect this signaling process. 

15 Channel blocking agents can inhibit secretion of lymphokines, cell proliferation, and 
killing of target cells. 

Table 1 lists several of the peptides described herein that appear, based upon 
structural homology, to be transporters. These peptides (and their corresponding source 
proteins) can therefore be used to treat disorders associated with inappropriate transporter 

20 expression or activity. Examples of such disorders include neurological, muscle, and 
immunological disorders, or to screen for agonists and antagonists useful for the same 
purpose. In addition, nucleic acids encoding the proteins as well as compounds (e.g., 
antibodies) that recognize the proteins can be used in a wide variety of applications 
described herein, including therapeutics, diagnostics, and drug screening. 

25 The invention therefore includes the following peptides as transporters: SEQ ID NO: 1 , 
SEQ ID NO:25, SEQ ID NO:48, SEQ ID N0:51, SEQ ID NO:52, SEQ ID NO:56, SEQ 
ID NO:58, SEQ ID NO:59, SEQ ID N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID 
NO:64, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID 
N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:90, SEQ ID 

30 NO:94, SEQ ED NO:100, SEQ ID N0:116, SEQ ID NO:128, SEQ ID NO:130, SEQ ID 
N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID N0:141, SEQ ID NO:170, SEQ ID 
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NO:178, SEQ ID NO:187, SEQ ID NO:189, SEQ ID NO:203, SEQ ID NO:207, SEQ ID 
NO:219, and SEQ ID NO:234. 

Cvtoskeletal Proteins 

5 As described in Examples 1 and 2 (and the accompanying table), many of the 

peptides described herein are derived from proteins that appear to be cytoskeletal 
proteins. The physical-biochemical processes of cell motility, organelle movement, 
chromosome movement, cytokinesis, and generation of cell shape are all dependent on a 
complex of protein fibers found in the cytoplasm. This protein complex is termed the 

10 cytoskeleton. The cytoskeleton of eukaryotic cells has three major filamentous systems. 
These systems are the actin filaments, intermediate filaments, and microtubules. Each of 
these filamentous systems is assembled from different proteins, including actin, myosin, 
tubulins, and intermediate filament proteins. Different cell types and tissues express 
specific isoforms of the proteins which comprise these filaments. In some cases distinct 

15 isoforms and mRNA splice variants are associated with cell-type specific functions 
(Lees-Miller, JP. and Helfman, D.M. (1991) BioEssays 13:429-437). 

Cell motility is governed by the interaction between cytoskeletal and other 
cellular proteins. Cytoskeletal proteins that are involved in the generation of motive 
force within the cell are termed contractile proteins. Cytoskeletal proteins are involved in 

20 the regulation of muscle contraction. Vertebrate smooth muscle contraction is dependent 
upon levels of cAMP and intracellular calcium ions. 

Cytoskeletal proteins are implicated in several diseases. Pathologies such as 
muscular dystrophy, nephrotic syndrome, and dilated cardiomyopathy have been 
associated with differential expression of alpha-actinin-3 (Vainzof, M. et al. (1997) 

25 Neuropediatrics 28:223-228; Smoyer, W£. and Mundel, P. (1998) J. Mol. Med. 76:172- 
183; and Sussman, M.A. et al. (1998) J. Clin. Invest. 101:51-61). Alpha actinin and 
several microtubule associated proteins (MAPs) are present in Hirano bodies, which are 
observed more frequently in the elderly and in patients with neurodegenerative diseases 
such as Alzheimer's disease (Maciver, S.K. and Harrington, C.R. (1995) Neuroreport. 

30 6:1985-1988). Actinin-4, an actin-bundling protein, appears to be associated with the cell 
motility of metastatic cancer cells. Other disease associations include premature 
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chromosome condensation, which is frequently observed in dividing cells from tumor 
tissue (Mumane, J.P.(1995) Cancer Metastasis Rev. 14:17 29), and the significant roles 
of axonemal and assembly MAPs in viral pathogenesis (Sodeik, B. et al. (1997) J. Cell 
Biol. 136:1007 1021). 

5 Table 1 lists several of the peptides described herein that appear, based upon 

structural homology, to be cytoskeletal proteins. These peptides (and their corresponding 
source proteins) can therefore be used to treat disorders associated with inappropriate 
cytoskeletal protein expression or activity. Examples of such disorders include ceU 
proliferative, inamunological, vesicle trafficking, reproductive, smooth muscle, 

10 developmental, and nervous disorders, or to screen for agonists and antagonists useful for 
the same purpose. In addition, nucleic acids encoding the proteins as well as compounds 
(e.g., antibodies) that recognize the proteins can be used in a wide variety of applications 
described herein, including therapeutics, diagnostics, and drug screening. 

The invention therefore includes the following peptides as cytoskeletal proteins; 

15 SEQ ID N0:118, SEQ ID NO:144, SEQ ID NO:177, SEQ ID NO:183, and SEQ ID 
NO:185. 

Receptors 

As described in Examples 1 and 2 (and the accompanying table), many of the 
20 peptides described herein are derived from proteins that appear to be receptors. 

Receptors are a broad category of proteins that specifically recognize other molecules. 
Many receptors are ceU surface proteins that bind extracellular ligands and produce 
cellular responses in the areas of growth, differentiation, endocytosis, and in^nune 
response. Other receptors facilitate the selective transport of proteins out of the 
25 endoplasmic reticulum and localize enzymes to particular locations in the cell. The 
propagation of cellular signals and the transport and localization of proteins rely upon 
specific interactions between receptors and a variety of associated proteins. Examples of 
families of receptors include: G-protein Coupled Receptors (GPCRs); MHC molecules; 
hormone receptors; and TNF receptor superfamily members. 
30 Receptor-mediated signal transduction is the process whereby cells conomunicate 

with one another and respond to extracellular signals via a series of biochemical events. 
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Extracellular signals are transduced through a biochemical cascade that begins with the 
binding of a signal molecule to a cell membrane receptor. The signal is propagated to 
effector molecules by intracellular signal transducing proteins and culminates with the 
activation of an intracellular target molecule. The process of signal transduction 
5 regulates a wide variety of cell functions including cell proliferation, cell differentiation, 
induction of immune responses, and gene transcription. 

Table 1 lists several of the peptides described herein that appear, based upon 
structural homology, to be receptors. These peptides (and their corresponding source 
proteins) can therefore be used to treat disorders associated with inappropriate receptor 

10 expression or activity. Examples of such disorders include inmiunological disorders 
(including autoimmune/inflammatory disorders) and cell proliferative disorders 
(including cancer), or to screen for agonists and antagonists useful for the same purpose. 
In addition, nucleic acids encoding the proteins as well as compounds (e.g., antibodies) 
that recognize the proteins can be used in a wide variety of applications described herein, 

15 including therapeutics, diagnostics, and screening. 

The invention therefore includes the following peptides as receptors: SEQ ID 
N0:1, SEQ ID NO:5, SEQ ID NO:6, SEQ ID N0:41, SEQ ID NO:44, SEQ ID NO:45, 
SEQ H) NO:46, SEQ ID NO:48, SEQ ID NO:49, SEQ ID N0:51, SEQ ID NO:53, SEQ 
ID NO:54, SEQ ID NO:55, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 

20 NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:66, SEQ ID NO:67. SEQ ID NO:69, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 
NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:86, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID 
NO:91, SEQ ID NO:92, SEQ ID NO:94, SEQ ID NO:100, SEQ ID NO:104, SEQ ID 

25 N0:112, SEQ ID N0:121, SEQ ID NO:I22, SEQ ID NO:123, SEQ ID NO:128, SEQ ID 
NO:129, SEQ ID NO:133, SEQ ID NO:135, SEQ ID NO:143, SEQ ID NO:145, SEQ ID 
NO:147, SEQ ID N0:150, SEQ ID N0:161, SEQ ID NO:164, SEQ ID NO:166, SEQ ID 
NO:168, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID 
NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:180, SEQ ID NO:187, SEQ ID 

30 NO:188, SEQ ID NO:194, SEQ ID N0:211, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:221, and SEQ ID NO:230. 
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Transcription Factors 

As described in Examples 1 and 2 (and the accompanying table), many of the 
peptides described herein are derived from proteins that appear to be transcription factors. 

5 Regulation of gene transcription is the primary process by which a cell controls the 

appropriate expression of the multitude of genes necessary for growth and differentiation. 
The selective expression of genes at appropriate times is highly specialized in cells of 
multicellular organisms and permits the cells to perform "housekeeping" functions and 
respond to changes in theiij^ environment. These changes involve extracellular signals 

10 from a variety of sources such as honnones, neurotransmitters, and growth and 
differentiation factors. 

Gene transcription is controlled by proteins termed transcription factors. 
Transcription factors act by binding to a short segment of DNA located near the site of 
transcription initiation. Binding of a transcription factor to the target DNA activates 

15 transcription of the gene. Transcription factors contain a variety of structural motifs that, 
alone or in combination with one another, permit them to recognize and bind to the wide 
variety of target DNA sequences. 

One group of transcription factors, the TFUIA subclass of zinc-finger proteins, is 
characterized by an amino acid motif (a cysteine followed by two to four amino acids, a 

20 cysteine, twelve amino acids, a histidine, three to four amino acids, and a histidine) that 
interacts with zinc ions. The carboxyl terminus of the TFIHA proteins has three of these 
"zinc finger" motifs and specifically binds to DNA fragments containing a CACCC 
pattern. The amino-terminal portion of the I'h'lllA proteins is proline and serine-rich and 
can function as a transcriptional activator. TFUIA proteins are often important for the 

25 proper differentiation of tissues in which they are expressed. 

Table 1 lists several of the peptides described herein that appear, based upon 
structural homology, to be transcription factors. These peptides (and their corresponding 
source proteins) can therefore be used to treat disorders associated with inappropriate 
transcription factor expression or activity, or to screen for agonists and antagonists useful 

30 for the same purpose. Examples of such disorders include cancer, arthritis, and 
developmental disorders. In addition, nucleic acids encoding the proteins as well as 
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compounds (e.g., antibodies) that recognize the proteins can be used in a wide variety of 
applications described herein, including therapeutics, diagnostics, and screening. 

The invention therefore includes the following peptides as transcription factors: 
SEQ ID N0:2, SEQ ID NO:10, SEQ ID N0:14, SEQ ID N0:15, SEQ ID NO:22, SEQ 
5 ID NO:27, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:40, SEQ ID NO:43, SEQ ID 
NO:96, SEQ ID NO:102, SEQ ID N0:117, SEQ ID NO:120, SEQ ID NO:138, SEQ ID 
NO:177, SEQ ID NO:183, SEQ ID NO;184, and SEQ ID NO:208. 

Therapeutics 

10 As described above, many of the peptides of SEQ ID NOs: 1-235 belong to 

biological classes of proteins that have been implicated in a wide variety of disease 
conditions. These biological classes include kinases, phosphatases, receptors, proteases, 
transcription factors, transporters (such as ion channels), and cytoskeletal proteins. 
Additional biological classifications of many of the peptides of SEQ ID NOs:l-235 are 

15 detailed in the "biological class" column of the Table. Members of these additional 
classifications have also been characterized as being associated with specific disorders. 

In addition to disorders associated with discrete biological classes, many of the 
peptides of SEQ ID NOs: 1-235 were derived from transformed cells and thus may be 
involved in cellular proliferative and/or differentiative disorders, e.g., cancer. The 

20 Examples and associated table describe in detail the specific transformed cell lines with 
which the individual peptides of the application have been found to be associated. 
Because these peptides have been found to be translated in transformed cells, they are 
expected to be usefiil in therapeutic, diagnostic, and screening applications as described 
herein. For example, in one embodiment, a compound that modulates (increases or 

25 decreases) the expression or activity of a polypeptide containing any of SEQ ID NOs: 1- 
235 can be used to treat or prevent a cellular proliferative and/or differentiative disorder, 
e.g., a B cell cancer such as myelmoa, colon cancer, gastric cancer, adenocarcinoma, 
sarcoma, melanoma, lymphoma, or leukemia. 

In one embodiment, a polypeptide containing any of SEQ ID NOs: 1-235 (or a 

30 nucleic acid encoding such a polypeptide) can be administered to a subject to treat a 
disorder. For example, a disorder characterized by insufficient levels of a given 
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polypeptide, e.g., a phosphatase or an ion channel, can be treated by such a method In 
one example, a secreted protein described herein, e.g., a cytokine, is administered to a 
subject to treat a disorder. 

In one embodiment, antagonists or inhibitors of a polypeptide containing any of 

5 SEQ ID NOs: 1-235 may be administered to a subject to treat or prevent a disorder. In 
one aspect, antibodies specific for a polypeptide containing any of SEQ ID NOs:l-235 
may be used directly as an antagonist, or indirectly as a targeting or delivery mechanism 
for bringing a pharmaceutical agent to cells or tissue that expresses the polypeptide. 

The invention features a method for treating cancer comprising administering to a 

10 patient in need of such treatment an amount of a composition comprising a polypeptide as 
described herein in an amount sufficient to elicit an inmiunogenic response. Also, the 
invention features a method for treating a cancer patient, the method comprising 
administering to the patient an antibody that selectively binds to a peptide as described 
herein. 

15 In other embodiments, therapeutic proteins, antagonists, antibodies, agonists, 

antisense sequences or vectors may be administered in combination with other 
appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional 
pharmaceutical principles. The combination of therapeutic agents may act synergistically 

20 to effect the treatment or prevention of the various disorders described above. Using this 
approach, one may be able to achieve therapeutic efficacy with lower dosages of each 
agent, thus reducing the potential for adverse side effects. 

Antagonists or inhibitors of the polypeptides may be produced using methods 
which are generally known in the art In particular, purified polypeptides may be used to 

25 produce antibodies or to screen libraries of pharmaceutical agents to identify those which 
specifically bind the polypeptide. Cells expressing a nucleic acid of the invention can be 
screened against the same libraries to find agents that bind and/or affect the activity of the 
encoded polypeptide. 

An additional embodiment of the invention relates to the administration of a 

30 pharmaceutical composition, in conjunction with a pharmaceutically acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions 



42 



wo 02/078524 



PCTAJS02/09671 



may consist of a polypeptide containing any of SEQ ID NOs: 1-235, antibodies to the 
polypeptide, mimetics, agonists, antagonists, or inhibitors of the polypeptide. In addition 
to the active ingredients, these phannaceutical compositions may contain suitable 
pharraaceutically-acceptable carriers comprising excipients and auxiliaries which 

5 facilitate processing of the active compounds into preparations which can be used 

pharmaceutically. Furtiier details on techniques for formulation and administration may 
be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing 
Co., Easton, Pa.), The compositions may be administered alone or in combination with at 
least one other agent, such as a stabilizing compound, which may be administered in any 

10 sterile, biocompatible pharmaceutical carrier, including, but not limited to, saline, 

buffered saline, dextrose solution, and water. The compositions may be administered to a 
patient alone, or in combination with other agents, drugs or hormones. 

Diagnostics 

15 In another embodiment, compounds (e.g., antibodies) that specifically bind to a 

polypeptide containing any of SEQ ID NOs: 1-235 may be used for the diagnosis of 
conditions or diseases characterized by expression of the polypeptide, or in assays to 
monitor patients being treated with the polypeptide, agonists, antagonists or inhibitors. 
Antibodies useful for diagnostic purposes may be prepared in the same manner as those 

20 prepared for therapeutic purposes. Diagnostic assays for a polypeptide containing any of 
SEQ ID NOs: 1-235 include methods that utilize the antibody and a label to detect the 
polypeptide in human body fluids or extracts of cells or tissues. The antibodies may be 
used with or without modification, and may be labeled by joining them, either covalently 
or non-covalently, with a reporter molecule. A wide variety of reporter molecules that are 

25 . known in the art may be used, several of which are described above. 

In another embodiment of the invention, a polynucleotide, e.g., a polynucleotide 
encoding a polypeptide containing any of SEQ ID NOs: 1-235, may be used for diagnostic 
purposes. The polynucleotides that may be used include oligoniicleotides, antisense 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 

30 quantitate gene expression in biopsied tissues in which expression of a polypeptide 
described herein may be correlated with disease. Hie diagnostic assay may be used to 
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distinguish between the absence, presence, and excess expression of an nnJRNA encoding 
a polypeptide containing any of SEQ ID NOs: 1-235, and to monitor regulation of mRNA 
levels during therapeutic intervention. 

A polynucleotide encoding a polypeptide containing any of SEQ ID NOs: 1-235 
5 may be used for the diagnosis of conditions or diseases that are associated with 

expression of the polypeptide. Examples of such conditions or diseases include cancers 
such as cancer of the testis, colon, prostate, uterus, cervix, ovary, lung, intestine, liver, 
breast, skin, heart, brain, stomach, pancreas, and spleen. The polynucleotide encoding 
the polypeptide may be used in Southern or northern analysis, dot blot, or other 

10 membrane-based technologies; in PGR technologies; or in dip stick, pin, ELISA or chip 
assays utilizing fluids or tissues from patient biopsies to detect altered mRNA expression. 
Such qualitative or quantitative methods are v^ell known in the art. 

As the peptides described herein were found to be translated in transformed cells, 
these peptides can thus function as markers for a transfonned cell, e.g., a cancer cell. As 

15 such, detection of polypeptides containing these peptides (or nucleic acids encoding the 
same) are particularly useful in the diagnosis of cellular proliferative and/or 
differentiative disorders such as cancer. 

Screening Assays 

20 The invention provides methods for identifying modulators, i.e., candidate or test 

compounds or agents (e.g., proteins, peptides, peptidomimetics, peptoids, small 
molecules or other drugs) which bind to a polypeptide containing any of SEQ ID NOsrl- 
235, have a stimulatory or inhibitory effect on, for example, expression or activity of the 
polypeptide, or have a stimulatory or inhibitory effect on, for example, the expression or 

25 activity of a substrate of the polypeptide. Compounds thus identified can be used to 
modulate the activity of target gene products in a therapeutic protocol, to elaborate the 
biological function of the target gene product, or to identify compounds that disrupt 
normal target gene interactions. 

The compounds that may be screened in accordance with the invention include, 

30 but are not Umited to peptides, antibodies and fragments thereof, and other organic 
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compounds that bind to a polypeptide containing any of SEQ ID NOs: 1-235 and increase 

or decrease an activity of the polypeptide. 

Such compounds may include, but are not limited to, peptides such as soluble 

peptides, including but not limited to members of random peptide libraries (Lam et al., 
5 Nature 354:82 [1991]; Houghten et al.. Nature 354:84 [1991]) and combinatorial 

chemistry-derived molecular libraries made of D- and/or L configuration amino acids; 

phosphopeptides (including but not limited to members of random or partially 

degenerate, directed phosphopeptide libraries; Songyang et al.. Cell 72:767 [1993]); 

antibodies (including but not limited to polyclonal, monoclonal, humanized, anti- 
10 idiotypic, chimeric and single chain antibodies; FAb, F(ab02 and FAb expression library 

fragments; and epitope-binding fragments thereof); and small organic or inorganic 

molecules. 

Other compounds that can be screened in accordance with the invention include 
but are not limited to small organic molecules that are able to gain entry into an 

15 appropriate cell and affect (1) the expression of the gene encoding a polypeptide 
containing any of SEQ ID NOs: 1-235 or (2) the activity of the polypeptide. 

As used herein "small molecules" include, but are not limited to, peptides, 
peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, 
polynucleotide analogs, nucleotides, nucleotide analogs, organic or inorganic compounds 

20 (i.e.,. including heteroorganic and organometallic compounds) having a molecular weight 
less tiian about 10,000 grams per mole, organic or inorganic compounds having a 
molecular weight less tiian about 5,000 grams per mole, organic or inorganic compounds 
having a molecular weight less than about 1,000 grams per mole, organic or inorganic 
compounds having a molecular weight less than about 500 grams per mole, and salts, 

25 esters, and other pharmaceutically acceptable forms of such compounds. 

Computer modeling and searching technologies permit identification of 
compounds, or the improvement of already identified compounds, that can modulate 
expression or activity of a polypeptide containing any of SEQ ID NOs: 1-235. Having 
identified such a compound or composition, the active sites or regions are identified. 

30 Such active sites might typically be a binding for a natural modulator of activity. The 
active site can be identified using methods known in the art including, for example, from 
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the anuno acid sequences of peptides, from the nucleotide sequences of nucleic acids, or 
from study of complexes of the relevant compound or composition with its natural ligand. 
In the latter case, chemical or X-ray crystallographic methods can be used to find the 
active site by finding where on the factor the modulator (or ligand) is found. 
5 Although described above with reference to design and generation of compounds 

that could alter binding, one could also screen libraries of known compounds, including 
natural products or synthetic chemicals, and biologically active materials, including 
proteins, for compounds which bind to a polypeptide containing any of SEQ ID 
NOs:l-235. 

10 In vitro systems may be designed to identify compounds capable of interacting 

with a polypeptide containing any of SEQ ID NOs:l-235. Compounds identified may be 
useful, for example, in the treatment of conditions such cellular proliferative and 
differentiative disorders, e.g., cancer. 

The principle of the assays used to identify compounds that bind to a polypeptide 

15 containing any of SEQ ID NOs: 1-235 involves preparing a reaction mixture of the 

polypeptide (or a domain thereof) and the test compound under conditions and for a time 
sufficient to allow the two components to interact and bind, thus forming a complex 
which can be removed and/or detected in the reaction mixture. The polypeptide species 
used can vary depending upon the goal of the screening assay. In some situations it is 

20 preferable to employ a peptide corresponding to a domain of the polypeptide fused to a 
heterologous protein or polypeptide that affords advantages in the assay system (e.g., 
labeling, isolation of the resulting complex, etc.) can be utilized. 

The screening assays can be conducted in a variety of ways. For example, one 
method to conduct such an assay involves anchoring a peptide (or polypeptide or fusion 

25 protein) or the test substance onto a solid phase and detecting peptide/test compound 
complexes anchored on the solid phase at the end of the reaction. In one embodiment of 
such a method, the peptide reactant may be anchored onto a solid surface, and the test 
compound, which is not anchored, may be labeled, either directly or indirectly. The 
invention features a peptide array comprising at least 100 peptides selected from the 

30 group consisting of peptides as described herein, each peptide linked to a solid support at 
a known location. Additionally, the invention features a collection of at least 10 
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polypeptide arrays, each array comprising at least 100 polypeptides as described herein, 
each peptide linked to a solid support at a known location. Peptide airays and methods 
for producing such arrays are described in, e.g., U.S. Patent No. 5,591,646. 

In practice, microtiter plates may conveniently be utilized as the solid phase. The 

5 anchored component may be immobilized by non-covalent or covalent attachments, Non- 
covalent attachment may be accomplished by simply coating the solid surface with a 
solution of the protein and drying. Alternatively, an immobilized antibody, preferably a 
monoclonal antibody, specific for the protein to be immobilized may be used to anchor 
the protein to the solid surface. The surfaces may be prepared in advance and stored. 

10 In order to conduct the assay, the nonimmobilized component is added to the 

coated surface containing the anchored component. After the reaction is complete, 
unreacted components are removed (e.g., by washing) under conditions such that any 
complexes formed will remain immobilized on the solid surface. The detection of 
complexes anchored on the solid surface can be accomplished in a number of ways. 

1 5 Where the previously non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the previously 
non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 
previously non-immobilized component (the antibody, in turn, may be directly labeled or 

20 indirectly labeled with a labeled anti-Ig antibody). 

Alternatively, a reaction can be conducted in a liquid phase, the reaction products 
separated from unreacted components, and complexes detected, e.g., using an 
immobilized antibody specific for a polypeptide of the invention or the test compound to 
anchor any complexes formed in solution, and a labeled antibody specific for the other 

25 component of the possible complex to detect anchored complexes. 

Alternatively, cell-based assays can be used to identify compounds that interact 
with a polypeptide containing any of SEQ ID NOs: 1-235. To this end, cell lines that 
express the polypeptide, or cell lines that have been genetically engineered to express the 
polypeptide can be used. Cell based assays are particularly useful for evaluating the 

30 functional effects of a compound identified by a screen described herein. For example, 
once a compound is identified based upon its ability to bind to a polypeptide of the 
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invention, the compound can then be tested for its ability to, e.g., bind to and/or induce 
the selective killing of transformed cells. 

Use of Peptides and Nucleic Acids Encoding Peptides to Inhibit an Immune Response 

5 The MHC-binding peptides of SEQ ID NOs:l-235 and the nucleic acids encoding 

them can be used to block MHC class I and class H-mediated antigen presentation to T 
cells and thereby inhibit an immune response. Inhibiting an immune response can be 
particularly useful in conditions such as autoimmune disorders. Methods of using 
*T?locking peptides" to prevent MHC-mediated presentation of antigens to T cells are 

10 described in U.S. Patent No. 5,827,516. For a polypeptide, e.g., a fusion protein, 

containing an MHC-binding peptide sequence of any of SEQ ID NOs: 1-235, introduction 
of the polypeptide (or a nucleic acid encoding the polypeptide) to a cell is expected to 
result in the processing and presentation of the peptide sequence in the context of an 
MHC class I or class n molecule. 

15 Peptides described herein may be also useful for inhibiting an inmiune response 

when complexed with an MHC molecule, e.g., an HLA molecule, and administered to a 
host, e.g., a human. The use of HLA/peptide complexes to induce T cell 
nonresponsiveness has been described for the treatment of autoimmune conditions (see, 
e.g., Nag et al., 1996, Cell. Immunol. 170:25; Arimilli et al., 1996, Immunol. Cell. Biol. 

20 74:96; Prokaeva, 2000, Curr. Opin. Investig. Drugs 1:70). In addition, antibodies 
directed against HLA/peptide complexes may be useful in treating disease and/or 
blocking T cell activation. 

Use of Peptides and Nucleic Acids Encoding Peptides as References for MHC Class I 

25 and Class II Binding 

Some of the peptides of SEQ ID NOs: 1-235 have been characterized as binding to 
MHC class I or class n molecules (see Example section). These peptides, polypeptides 
containing them and nucleic acids encoding the same are therefore useful as references in 
evaluating the ability of a test peptide to bind to an MHC molecule. For example, a 

30 peptide described herein (a "reference peptide") can be used in a competitive assay 

wherein a test peptide is evaluated for its ability to compete with the reference peptide for 

48 



wo 02/078524 



PCT/US02/09671 



binding to an MHC molecule. The reference peptide can optionally be labeled, e.g., with 
a radioactive label, and displacement of bound label in the presence of a test peptide can 
be measured. Alternatively, the test peptide can be labeled. Competitive peptide binding 
assays using a reference peptide are described in, e.g., U.S. Patent 6,037,135. 

5 

Delivery Systems > 

The purified polypeptides, or complexes containing them (such as heat shock 
protein or MHC complexes), or isolated nucleic acids, can be administered using standard 
methods, e.g., those described in Donnelly et al. (1994) J. Imm. Methods 176:145, and 

10 Vitiello et al. (1995) J. Clin. Invest. 95:341. Purified polypeptides and/or isolated nucleic 
acids of the invention can be injected into subjects in any manner known in the ait, e.g., 
intramuscularly, intravenously, intraarterially, intradermally, intraperitoneally, 
intravaginally, or subcutaneously, or they can be introduced into the gastrointestinal tract 
or the respiratory tract, e.g., by inhalation of a solution or powder containing the 

15 polypeptides or nucleic acids. Alternatively, the purified polypeptides or isolated nucleic 
acids of the invention may be applied to the skin, or electroporated into the cells or tissue. 
Purified polypeptides or isolated nucleic acids of the invention may be electroporated 
with the delivery systems (e.g. microparticles, hydrogels and polymer networks) 
described herein. 

20 The purified polypeptides and isolated nucleic adds encoding polypeptides can be 

delivered in a pharmaceutically acceptable carrier such as saline, lipids, depot systems, 
hydrogels, networks, liposomes, particulates, virus-like particles, microspheres, or 
nanospheres; as colloidal suspensions; or as powders. The nucleic acid can be naked or 
associated or complexed with a delivery vehicle. For a description of the use of naked 

25 DNA, see, e.g., U.S. Patent No. 5,693,622. For a description of the use of encapsulated 
DNA see, e.g., U.S. Patent No. 5,783,567. For a description of the use of hydrogel and 
network delivery systems for DNA delivery see, e.g., USSN 60/262,219. Nucleic acids 
and polypeptides can be delivered using delivery vehicles known in the art, such as lipids, 
liposomes, ISCOMS, microspheres, microcapsules, microparticles, gold particles, virus- 

30 like particles, nanoparticles, hydrogels or networks, polymers, condensing agents, 

polysaccharides, polyamino acids, dendrimers, saponins, adsorption enhancing materials, 
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or fatty acids. Viral particles can also be used, e.g., retroviruses, adenovirus, baculovinis, 
adeno-associated virus, pox viruses, SV40 virus, alpha virus or herpes viruses. 

It is expected that a dosage of approximately 0.1 to 100 ^naoles of the 
polypeptide, or of about 1 to 200 pg of DNA, would be administered per kg of body 

5 weight per dose. As is well known in the medical arts, dosage for any given patient 
depends upon many factors, including the patient's size, body surface area, age, the 
particular compound to be administered, sex, time and route of administration, general 
health, and other drugs being administered concurrently. Determination of optimal 
dosage is well within the abilities of a pharmacologist of ordinary skill. 

10 Other standard delivery methods, e.g., biolistic transfer, or e% vivo treatment, can 

also be used. In ex vivo treatment, cells, e.g., antigen presenting cells (APCs), dendritic 
cells, peripheral blood mononuclear cells, or bone marrow cells, can be obtained from a 
patient or an appropriate donor and treated ex vivo with a composition of the invention, 
and then returned to the patient. 

15 Microparticles, including those described in U. S. Patent No. 5,783,567 and 

USSN 60/208,830, can be used as vehicles for delivering macromolecules such as DNA, 
RNA, or polypeptides into cells. Microparticles may also be made, for example, 
according to the methods of Mathiowitz, et al. as described in WO 95/24929, herein 
incorporated by reference. The microparticles can contain macromolecules embedded in 

20 a polymeric matrix or enclosed in a shell of polymer. Microparticles act to maintain the 
integrity of the macromolecule, e.g., by maintaining the DNA in a nondegraded state. 
Microparticles can also be used for pulsed delivery of the macromolecule, and for 
delivery at a specific site or to a specific cell or target cell population. 

The polymeric matrix can be a synthetic or natural biodegradable co-polymer 

25 such as poly-lactic-co-glycolic acid, starch, gelatin, or chitin. Microparticles that are less 
than 10 jiM in diameter can be used in particular to maximize dehvery of DNA 
molecules into a subject's phagocytotic cells. Alternatively, microparticles that are 
greater than 10 [jM in diameter can be injected or implanted in a tissue, where they form 
a deposit. As the deposit breaks down, the nucleic acid or polypeptide is released 

30 gradually over time and taken up by neighboring cells. 
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The purified polypeptides and isolated nucleic acids of the invention can be 
administered by using Ltmnune Stimulating Complexes (ISCOMS), which are negatively 
charged, cage-like structures of 30-40nm in size formed spontaneously on mixing 
cholesterol and Quil A (saponin), or saponin alone. A polypeptide (or analog) and 
5 nucleic acid of the invention can be co-administered with an ISCOM, or the polypeptide 
(or analog) and nucleic acid can be administered separately. The polypeptides and 
nucleic adds of the invention may also be electroporated into cells or tissues of a 
recipient. Electroporation may occur ex vivo or in vivo, 

10 Peptide Profiles and Databases 

U.S. Patent Application 09/372,380, the content of which is herein incorporated 
by reference, provides compositions and methods for the characterization of a cell's 
protein repertoire and the storage and manipulation of that information in a computer 
database. A characteristic profile or fingerprint of peptides or polypeptide ligands can be 

15 generated, for example, for a given cell type, for diseased vs. normal cells, and for 
different metabolic or developmental states of a cell. Appropriate comparisons of the 
profiles can be used to identify cellular targets useful in diagnostics, drug screening and 
development, and delivering therapeutic regimens. The EPTs described herein, the 
MHC-binding peptides of SEQ ID NOs: 1-235, represent a population of polypeptide 

20 ligands that can be used in the methods, ligand profiles, and databases described in USSN 
09/372,380. In addition to EPTs, all of the peptides described herein can be used to 
catalogue and profile the protein composition of a cell. The following are several non- 
limiting examples of uses of the peptides for identifying, cataloguing and profiling the 
protein composition of a cell. 

25 Peptides and proteins from which they are derived can be used to identify, 

catalogue and characterize most or all proteins expressed within a cell for any given cell 
type, metabolic or developmental stage, and disease vs. normal state, or in response to a 
test substance such as a given hormone, growth factor, transcription factor, cytokine, 
small molecule, polypeptide, nucleic acid, carbohydrate or lipid. The approach can also 

30 identify differences between transgenic vs. non-transgenic cells, or transfected vs. non- 
transfected cells. As such, the invention relates to the identification of "polypeptide or 
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peptide profiles" of a cell type of interest. These profiles can be used to pre-sort cellular 
proteins for "proteomics" analysis, greatly reducing the screening effort and increasing 
the efficiency of identifying cellular proteins involved in developmental and metabolic 
disease processes. Appropriate comparisons of the profiles can be used to identify 

5 cellular targets useful in diagnostics, drug screening and development, and for developing 
therapeutic regimens. Such data mil facilitate the identification of proteins that have 
biological significance to a particular cellular state, e.g., in metabolism, maturation, 
development, disease or treatment. 

Peptide esterification methods such as those described in U.S. Provisional 

10 Application No, 60/284,416, filed April 16, 2001, the content of which is herein 

incorporated by reference, can be used to detennine relative protein quantities in different 
cells or tissues. 

Peptides of the invention can be used for comparative purposes. A distinct 
peptide profile, e.g., an EPT profile, can be generated for each cell of interest The 

15 profiles of different ceUs, tissue or organ types of interest may be compared, and 

polypeptides may be identified that are differentially represented, e.g., present in one type 
of cell/tissue/organ, but absent from another, or expressed v/ith different abundance. 
Furthermore, "differential profiles" of polypeptides may be generated representing 
peptides that are differentially present in tiie two types of cells. 

20 Peptides described herein can be used to verify or confirm the distinct profile of a 

cell of interest In this use, polypeptides from cells that are essentially identical are 
isolated and compared. Comparison of the peptide profiles confirms that they are 
essentially identical, and together represent a reproducible ligand profile for the given 
cell type. For example, information can be obtained if the peptide profile or set of 

25 profiles that represents polypeptides derived from two or more types of MHC molecules 
in the given cell type are compared. For example, a subtraction profile of polypeptides is 
generated from comparing polypeptides isolated from two or more types of MHC 
molecules. 

A first cell sample and a second cell sample of interest may be obtained from 
30 different types of biological tissue (e.g., comparing smootii muscle tissue to skeletal 
muscle tissue), different cell types (e.g., endothelial cells and epithelial cells), different 
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organ systems (e.g., pancreas and lung), or the same organ system but cells of different 
status (e.g., terminally differentiated vs. embryonic, or healthy vs. diseased or 
predisposed to a disease). Alternatively, one can compare transfected cells which express 
a particular recombinant nucleic acid versus non-transfected cells or transfected cells 
5 which do not currently express the recombinant nucleic acid. One could also compare 
cells treated in a particular way (either in vivo or in vitro) vs. cells treated in a different 
way, or untreated 

For example, a treatment may involve administration of a test substance or drug 
candidate such as a growth factor, a hormone, a cytokine, a small molecule, a 

10 polypeptide, a nucleic acid, a carbohydrate, or a lipid. Alternatively, a treatment may 
involve exposing the cells to stress conditions such as trauma, hypoxia, deprivation of 
glucose, deprivation of an amino acid, deprivation of a nutrient, presence of a toxin, or 
low or high temperature. The cells are preferably vertebrate cells (e.g., from a bird or 
fish), and more preferably mammalian cells, e.g., from a human or from a non-human 

15 animal such as a non-human primate, a mouse, rat, guinea pig, hamster, rabbit, dog, cat, 
cow, horse, pig, sheep, or goat. By using a third cell sample, one could compare three 
different ceU samples, or compare the first sample to the second and to the third. For 
example, the second cell sample could be a positive control and the third cell sample a 
negative control, or the three cell samples could represent three different treatment 

20 regimens. 

In a variation on the above, one can simply compare the proteins expressed in a first 
cell sample to those expressed in a reference cell sample, by generating a peptide profile, 
e.g., an EPT profile, that is compared to an appropriate reference peptide profile. One 
compares first peptide profile to the reference peptide profile, in order to identify 

25 differences or similarities between the first cell sample and the reference cell sample. This 
and the other comparison methods described above can be used to compare, for example, 
cells cultured in the presence of a test compound to cells not cultured in the presence of the 
test compound; or cells from an animal treated with a test compound to cells (1) from the 
same animal before the treatment, or (2) from a second animal not treated 

30 Differential peptide profiles can be generated for cells of interest where one 

peptide profile consists of a subset of polypeptides that is differentially present in two (or 
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more) distinct cell types, disease stages, developmental stages, metabolic stages, cell 
cycle stages, treatment regimens, etc., of interest. As such, the differential profiles 
represent a repertoire of peptides that may directly or indirectly be involved in the 
different cellular phenotypes or behavior. Consequently, the differential profiles provide 

5 a valuable tool for the characterization of cell-type and/or phenotype-specific protein 
expression, and for the identification and/or the isolation of known or novel gene 
products and their respective coding sequences that are potentially involved in biological 
processes, such as developmental processes, establishment and progression of disease, 
predisposition to disease, organ development, signal transduction, differentiation, 

10 neurogenesis, etc., or in response to environmental factors or treatments. For example, 
the polypeptides identified as differentially expressed may be further characterized by 
determination of their chemical structure: i.e., sequence. Thus, the present invention 
provides for the characterization of differential expression, eg., the presence or absence, of 
gene products encoded by known genes and/or ESTs with unknown function. Tlie present 

15 invention thus can be used as an easy and efficient way to assign to previously identified 
genes or gene products a putative function and/or involvement or association with a 
particular developmental pathway, metabolic pathway, or disease stage. With this 
information, new targets for the development of gene therapy approaches and dmg 
development noay rapidly be identified. 

20 Peptide profiles for a given cell, tissue or organ of interest can be generated and 

stored in a database. The compilation of data can then be used for a number of applications. 
First, they are used as a reference point for a human patient's or animal's sample for the 
diagnosis of disease, progression of disease, and predisposition for disease. For example, if 
a disease is associated vwth changes in protein composition in certain cells, organ systems, 

25 cell sources, or tissue types, a suitable patient sample may be used to generate a protein 
profile, and compared with profiles of corresponding samples of normal (non-diseased) 
and/or diseased origin to assess presence or absence of, progression of, and/or predisposition 
to the particular disease in question. A large number of diseases may be diagnosed this way, 
including diseases for which particular aberrations in protein expression are known, 

30 including, but not limited to metabolic diseases that are associated with lack of certain 
enzymes, proliferative diseases that are associated with aberrant expression of, e,g.. 



54 



wo 02/078524 



PCT/US02/09671 



oncogenes or tumor suppressors, developmental diseases that are associated with aberrant 
gene expression, etc. Furthermore, the peptide profiles can be used for the diagnosis of 
diseases or other aberrations based on pre-determined differences in EPT profiles. Thus, if 
it is pre-detennined that a given disease of interest is associated with certain changes of the 

5 peptide profile of a particular type of cell, tissue, cell source, or organ system, a human 
patient or animal may be diagnosed based simply on its individual profile when compared to 
the profiles provided by a database. 

Second, peptide information can be used to detect protein translation cell, cell 
sample, or tissue sample. Such techniques can complement the detection of mRNA and be 

10 used to detect specific protein translation (particularly in diseased tissues). 

Third, the information stored in a database may be used to identify genes and their 
products that are involved in the manifestation of, progression of, or predisposition to any 
disease of interest, and with the development of symptoms of a particular disease. For 
example, peptide profiles of a diseased organ, tissue or cell type may be generated and 

15 compared with the corresponding profile counterpart obtained from a non-diseased 
sample. Differences in the profile may be identified, and individual peptides that are 
differentially present in the diseased vs. the non-diseased sample may be identified and 
isolated for further analysis. The identified differences in the peptide profiles are useful 
for future diagnosis of the disease or aberration, 

20 

Generating Peptide Profiles for Different Developmental, Metabolic or Disease Stages of 
a Given Type of Cell 

Peptide profiles for cells of different developmental, metabolic or disease stages, 
can be generated and compared to identify differences in protein or gene expression. For 

25 example, the profiles of a cancer cell and non-cancerous cell derived from the same 
genetically matched tissue may be generated and compared. Proteins differentiaUy 
expressed in diseased and non-diseased cells can convenienfly be identified, and their 
involvement in disease development and progression analyzed by methods well known in 
the art In this way, new targets for the treatment of the disease are efficiently identified. 

30 Alternatively, peptide profiles of cells of different developmental stages can be 

generated and compared. For example, profiles of embryonic cells and adult cells derived 
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from genetically matched tissue may be generated and compared to identify genes and their 
products that play a role in developmental processes, and that may be useful for the 
development of, e,g,, novel gene therapy or other therapeutic approaches for the treatment of 
developmental disorders. 

5 In another example, peptide profiles of (a) cells infected with a selected pathogen, 

e.g., microorganism, virus, retrovirus, or prion, and (b) corresponding non-infected cells are 
generated and compared to identify genes and gene products that are turned on or off in 
response to the infection. Alternatively, instead of being infected, the first cell can be made 
to take up a foreign protein or immunogenic substance, etc. This approach allows one, e.g., 

10 to identify factors produced by the cells in response to infection or introduction of the 
foreign substance that could be useful for therapeutic purposes. 

In another example, peptide profiles from cells derived from individuals having a 
selected genetic disorder and individuals that do not have such disorder are generated and 
compared. Preferably, samples from affected and non-affected family members are used 

15 for the generation of the profiles. Depending on the particular genetic disorder chosen, 
cell or tissue types that are known to be affected by the particular genetic disorder are 
studied. In many cases, profiles of various cell and/or tissue types will be generated and 
compared. This example allows one to identify genes and proteins associated with a 
genetic disorder. The information obtained may be useful for the development of gene 

20 therapy and other therapeutic approaches and for the development of targeted drugs that 
interfere with the expression of genes or activity or stability of gene products that are 
involved in the symptoms of die genetic disease. Furthermore, this example allows 
selection of diagnostic targets for the identification of individuals predisposed for certain 
types of disease or disease symptoms. 

25 

Generation of Peptide Profiles Correlated to Response of a Given Cell Type to External 
Factors 

In one example, a peptide profile of a given cell type treated with an external 
factor is generated and compared to a profile of cells of the same type which have not 
30 been so treated, to identify differences in protein expression. The cells can be 
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recombinant or native, a cell line or non-transformed cells, or isolated directly from an 
animal before and after treatment of the animal with the compound. 

For example, peptide profiles of cells of a selected origin or nature that have been 
contacted with a growth factor, cytokine or hormone, and cells that have not been 
5 contacted with the substance, but otherwise treated the same way, are generated and 
compared. This allows identification of genes and gene products that are turned on or 
turned off in response to the growth factor, cytokine or honnone, which will give, e.g., 
valuable insight in cellular signal transduction pathways and regulation of protein 
expression. 

10 Similarly, peptide profiles of cells that have been treated with or exposed to a 

polypeptide, small molecule, chemokine, or nucleic acid drug or drug candidate, and cells 
that have not been treated with or exposed to the substance, but have otherwise been 
treated the same way, are generated and compared. This allows one to identify the 
effects of the selected substance on protein expression in the cell, and is, for example, an 

15 excellent tool for the validation of particular drugs or the identification of drugs 
associated with expression of a selected gene or gene product. 

In another example, peptide profiles of cells that have been exposed to a selected 
type of compound, e.g., a selected carbohydrate or group of carbohydrates, lipid or group of 
lipids, amino acid or group of amino adds, nucleotide or nucleoside or group of either, or 

20 vitamin or group of vitamins, and cells that have not been treated with the compound, but 
have otherwise been treated the same way, are generated and compared. This allows one to 
identify the effects of the selected compound on the gene and protein expression of the cell, 
and will give valuable insight into metabolic processes. 

In another example, peptide profiles of cells that have been treated with a selected 

25 nucleic acid, e.g., a selected antisense oligonucleotide, a ribozyme, an expression vector, a 
plasmid, an RNA, or a DNA, and cells that have not been treated with the nucleic acid, but 
have otherwise been treated the same way, are generated and compared This allows one to 
identify the effects of the antisense oligonucleotide or other nucleic acid on the protein 
expression in the cell, and as such allows one to evaluate the efficacy or effect of the 

30 antisense oligonucleotide or nucleic acid. Antisense molecules and ribozymes of the 
invention may be prepared by any method known in the art for the synthesis of nucleic 
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acid molecules. These include techniques for chemically synthesizing oligonucleotides 
such as solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules 
may be generated by in vitro and in vivo transcription of DNA sequences encoding 
polypeptides. Such DNA sequences may be incorporated into a wide variety of vectors 

5 with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize antisense RNA constitutively or inducibly can be introduced 
into cell lines, cells, or tissues. 

Finally, peptide profiles of cells that have been subject to a selected stress 
condition, such as low or high temperature, hypoxia, oxidative stress, free radical- 

10 induced stress, deprivation of nutrients such as glucose, amino acids, or other essential 
factors, or presence of a toxin, are generated and compared to a peptide profile generated 
in untreated controls. Differentially expressed gene products are identified in order to 
give valuable insight into factors involved in cellular stress responses. This example 
provides an extremely valuable and efficient way to detennine and/or evaluate the effect 

15 of a selected compound on protein e^tpression in the cell. The technique may furthermore 
be useful to verify a desired shut-down of certain enzymatic activities, e.g., by 
distinguishing between phosphorylated and non-phosphorylated, or glycosylated and - 
non-glycosylated, peptides and/or proteins. It can also be used to aid in pharmacological 
and/or toxicological assessment of potential new drugs, and in screening for such drugs, 

20 

Generating Peptide Profiles for Different Organ Svstems 

Peptide profiles of cells derived from different organs or organ systems may be 

generated and compared to identify differences in protein or gene expression. For 

example, EPT profiles of cells derived from lung, liver, heart, spleen, skin, brain, kidney, 
25 thymus, intestine, and/or colon can be generated and compared. Differentially expressed 

genes and proteins are thus identified. This example is usefiil to identify proteins that are 

involved in an organ's particular physiological function, 

Tn another example, peptide profiles of selected tissue or cell types, e,g., muscle, 

endothelium, epithelium, neuronal, fat, ovarian, testicular, blood, bone marrow, and/or 
30 mammary tissue, etc., are generated, compared, and differentially expressed proteins 
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identified. This will give valuable insight into a proteins involvement in a tissue or cell 
type's physiological function. 

Generating Peptide Profiles for Expression Studies in Standard Cell Lines 
5 Peptide profiles of cells derived from differentially engineered standard cell lines 

can be generated and compared to identify differences in protein expression. For 
example, peptide profiles of standard cell lines that have been engineered to 
express/overexpress one or several selected recombinant genes, genes encoding a 
selected growth factor receptor or other signal transduction component, transcription factor, 

10 oncogene, apoptosis-inducing gene, etc., are generated and compared to peptide profiles 
prepared firom a reference cell line of the same origin, but which does not carry and express 
the selected recombinant gene. Differentially expressed genes and gene products are 
identified This will allow one to identify the impact of the overexpressed gene on the " 
expression of other polypeptides in the cell. 

15 The following examples are not to be construed as limiting the scope of the 

invention in any way. 

EXAMPLES 

20 Example 1: Isolation and Characterization of MHC Binding Peptides (EFTs) 

This example describes peptides identified by the immunoaffinity purification of 
class I and class II HLA molecules, followed by acid extraction and solid phase 
extraction of the EPT repertoire, reversed-phase HPLC separation, and mass 
spectrometry analysis. Methods used to derive the peptide sequences disclosed in this 

25 example are described in detail in U.S. Patent Application 09/372,380, filed 

August 11, 1999, the content of which is herein incorporated by reference. The various 
HLA molecules from which peptides were extracted are detailed in Table 1. 

Table 1 describes each of the peptides according to five criteria, as follows: (1) 
SEQ ID NO; (2) a numeric code corresponding to cell line and HLA type; (3) SEQ ID 

30 NOs of source protein reference(s); (4) source protein symbol; and (5) a function key 
corresponding to biological classification(s). 
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The SEQ ID NO for each peptide in Table 1 is Criteria 1. The other criteria 
follow to the right of the peptide sequence and are separated by a vertical hatch divider. 
Each new peptide entry begins on the next consecutive line having the next consecutive 
SEQ ID NO. 

5 Criteria 2 of Table 1 identifies a peptide according to the cell type and HLA type 

from which it was derived. A numeric code has been assigned to each combination of 
cell type and HLA type. The numeric code is as follows: 



NUMBER 


CELLJLdSIE r 


HLA_tyPE:: , 


1 


721.221 


All 


2 


721.221 


Al 


3 


721.221 


A24 


4 


721.221 


A3 


5 


721.221 


DRl 


6 


721.221 


PAN-DR 


7 


IM9 


A2 


8 


IM9 


N/A 


9 


JY 


A2 


10 


JY" 


B7C7 


11 


JY 


DR4_13 


12 


JY 


DR4 


13 


KATOm 


A2 


14 


KATOm 


CLASS 1 


15 


KATOm 


N/A 


16 


KATOm 


PAN-DR 


17 


LS174T 


A2 


18 


LS180 


A2 


19 


LS180 


CLASS 1 


20 


LS180 


PAN-CLASS 2 


21 


N/A 


All 


22 


NORMAL PBMC 


A2 


23 


NORMAL PBMC 


CLASS 1 


24 


PRESS 


A2 


25 


PRIESS 


DR4 


26 


PRIESS 


PAN-DR 


27 


SW403 


A2 


28 


SW480 


A2 


29 


U266 


A2 
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IM-9 is an EB V-transformed B lymphoblastoid cell line derived from the 
peripheral blood of a patient with multiple myeloma. This cell line is described in, e.g., 
Fahey et al. (1971) Ann. N.Y. Acad. Sci. 190: 221-234. 

U266 is a B lymphocyte cell line established from tissue obtained from a patient 
5 with myeloma. This cell line is described in, e.g.. Nilssonet al. (1970) Clin. Exp. 
Immunol. 7:477-489. 

LS180 is a hunaan colorectal adenocarcinoma cell line. The cell line is 
tumorigenic in nude naice. This cell line is described in, e.g., Tom et al. (1976) In Vitro 
12:180-191. LS174T is a trypsinized variant of LS180. 
10 SW403 and SW480 are human colorectal adenocarcinoma cell lines. The cell 

lines are tumorigenic in nude mice. The cell lines are described in, e.g., Fogh et al. 
(1977) J. Natl. Cancer Inst. 59:221-226. 

KATO in is a human gastric cancer cell line. The cell line is described in, e.g., 
Yamamoto et al. (1996) Cancer 77:1628-33. 
15 JY is a human lymphoblastoid cell line. The cell line is described in, e.g., J. Biol. 

Chem. (1979) 254:8709, J. Biol. Chem. (1975) 250:4512, and Proc Natl Acad Sd USA 
(1979)76:2273. 

721.221 is a human lymphoblastoid cell line that has been mutagenized to 
eliminate the expression of HLA-A, -B, and -C alpha chains. The cell line is described 

20 in, e.g., Shimizu et al. (1988) Proc. Natl. Acad. Sci USA 5:227-231. The 721.221 cell 
lines described herein were transfected with a nucleic acid encoding an individual MHC 
molecule, e.g., HLA-Al, -A2, -A3, or -All. 

Priess is a human B-lymphoblastoid cell line. The ceU line is described in, e.g., 
Hanania et al. (1983) EMBO J 2:1621-1624. 

25 The SEQ ID NOs of the source protein reference(s) for a given peptide are 

described as Criteria 3 of Table 1. "Source protein" refers to an amino acid sequence or 
predicted amino add sequence contained in a publicly available nucleotide and/or protein 
database having a region identical to an EPT sequence. In some cases, a "source protein" 
may not actually represent a protein from which a peptide is derived, but merely a protein 

30 (or predicted protein) containing a sequence identical to that of an EPT sequence. 

Peptides can be referenced to multiple different source proteins. The list of all identified 
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source proteins for any one peptide is listed in Table 1. The sequences corresponding to 
the SEQ ID NOs of the source proteins are in the accompanying sequence listing. 

The amino acid sequence for each of the source proteins was derived from NCBI 
(www.ncbi.nlm.nih.gov/PubMed/). The entire content of this reference is herein 
incorporated by reference. 

Criteria 4, "source protein symbol " provides the symbol identifying die source 
protein. Proteins may have been identified by different protein symbols in which case the 
different protein symbols for the source protein have been listed. Symbols are obtained 
from three places in the following order: (a) gene symbol{s) and alias(es) from Locus 
link; (b) gene name(s) from LocusLink; or (c) Locus titles from LocusLink 

Criteria 5, entitled "biological classification," provides a numeric key 
representing functional classifications for the peptide sequences. Several of these 
biological classes are described in detail in the application. All known biological 
classifications for a particular peptide are hsted in Table L The numeric key 
corresponding to the biological class is as follows: 



f 
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1 
i 


i V A>NrJ .f. 1 1 JIM 


L 




% 

J 




A 




C 

J 


DMA PTTsTnixm 

KIN A X3liNi>'liNLl 


0 


Kits UlN UUiJtiUrKU 1 liiiN 


/ 


■nXT A "RTMnTNTn PP riTTnTW 
UiN A-j51lNi->liN O r KU 1 lillN 


Q 

0 




Q 


TP AXrcp'PTDTmXT C^C\ P"DPPT3CCr\P 




POT TT TP A XTCr^PTPTTOXT 

r UJL» il 1 KAJN oUKlr 1 lUlN 


11 


UJNA rAClsALrliNLr 


1/ 


TP A TVTC Li CP A OTZ? 

1 KAJN ox*jQKAoxi 


13 


i^TTT>i^\A KnrVKl}r^Ty\^r\\Xr\Qr\\XX3 CTPTTr'*TTTPT7 

CliKUMAllN/CJlKUMUoUJVlb olKU UKJti 


1 A 

14 


IKANoCKlrllUiN KJiUULAllUJN 


Id 


UTCTOKTTJ A i TVT TP A TSTCT7CP A QTh 

rllo 1 UiNrS AL^iil 1 1^1 KJVINoriiKAoli 


10 


TP AMCr^PTPTTriXr Ar^m/ATTW/^ paptt^p 
IKAJNol^KLr IIUJN AL^llVAlUNU rAUlUK 


1 / 


A rmrvT" A top 
AdlVAlUK 


lo 


rNTTTroiTOP OP PTTPP'CQCOP 

lINxiLbliUK UK KJcJrKJiooUK 


1 0 
IV 


TP AXTCPOPTPP 
1 KAJNor UK 1 JiK 




POT" A C CTT TKH TP A XTQPOP T 
rUlAoolUM 1 KAJN or UK 1 


Zl 


CTTT "nOMVT TTPU A PPHTTPTOP 

^ UU^UJN 1 LrUKxiA KcL-xir 1 UK 


oo 
zz 


niT A XTKTPT rP A C CTVTO TP A XTCPOPTT^P 1 

LrlAJNINliLt LJrADolVrS IKAJNorUKlliKJ 


15 


TD A XrC/^PTPTTOM TJ A OTOP 

IKAJNoUKlrllUJN rAUlUK 


Z4 


PXTOOPtlOPVPTTXr ATC XJVT^P ATA CT7 

r HUorriUr x KU V A 1 li M i UKA 1 AoJa 


ZD 


PTJPPTJCOTOXT OTJ TP A XTCnPTPTTOKT T7P OA A POT TT PP OAAOTT3P 

KjirKboMUJN Ur IKAJNoUKLrllUJN rKUJvl rUL» 11 IrKUJVlUlJiK 


26 


LYASE 


27 


LEARNING AND MEMORY 


28 


SYNAF IIL TRAJNbMloolUN 


29 


SMALL MOLbCULii IKANorUKl 




f^t T TT A X/T A TTj CT/^XT ATT TXT/T. P A TUTS/ A V 

uLUlAMAlb MljNAJLLlNvj rA 111 WAY 


31 


TVPTTJ/^P AT TIT A CX^T A XATTXVTPP A XTD D"DOTT7TXT 

IN 1 liCjRALr FLASMA MnlVLDKAiNE rKU 1 bUN 


jZ 


XT XALt'TUVT T\ A CP A PT ATT3 CTTT TT/^TTV/TT ^JT T TT A ATT? 

N-AlblxliLrLi-AorAKlAlll orJ ..KL^llVc OL.U1 AMAlli 

P T7PT7PTOP 
KC^-^rVr i UK 


DD 


MThTTDOXTAT TP AXTQlwrrCQTOXJ 




RpnPPTn"R ^^TfrMAT T \HC\\ 

i\S2i\^S2JC X yJSs. ^OXVJiN x\XJLiXi> \J } 






36 


ONCOGENESIS 


37 


BASEMENT MEMBRANE 


38 


NUCLEAR CHROMOSOME 


39 


bNA MEDIATED TRANSFORMATION 


40 


CHONDROmN SULFATE PROTEOGLYCAN 


41 


CHROMOSOME ORGANIZATION AND BIOGENESIS 


42 


kflTOCHONDRION 
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43 


AA3-TYPE CYTOCHROME-C OXIDASE 


44 


ENERGY GENERATION 


45 


CYTOSOL 


46 


5 -NUCLEOTIDASE 


47 


HYDROLASE 


48 


METAIXOENDOPEPTIDASE 


49 


PROTEOLYSIS AND PEPTIDOLYSIS 




NFUROMUSCULvAR JUNCTION DEVELOPMENT 




PROTEASE f OTHER THAN PROTEASOMAL'y 




TRANSCRIPTION CO-FACTOR 

X xvnx ^ vJ v^xvxx xxvyx^ x.nv*xvyxv 




TR ANSCRTPTTTON FROM POL 11 PROMOTER 

± JXJTJL^ wv^x\j_r xx\-/i^ X xvvyJtTX X KJi^ XX X xvvyxTxv^ xx-txx 




RNA POLYMERASE 11 TRANSCRIPTION FACTOR 

fXVl^x* X \Jx^ X 1VX.I jxvniJX-f XX X xvrxx ^i..7>^xxxx x x vx ^ x xx.v-- x v-^xv 


SS 


I^GATTVE CONTROL OF CELL PROLIFERATION 

L TXJ\Jxxi X A V x^ ^vyx T X xv\-/x-# vyx ^✓i <i / x xv>^x-«xx x^^v^^x xv.'x ^ 




CONTROL OF CELL PROLIFERATION 


J / 


PFTT PRO! TFFR ATTON 


JO 


t^FVET OPMFNTAL PROCESSES 

xJl-i V f «.l.i*vX IVXJUuL 1 A xTJU* X XW^X-AJwJX^U 




DIFFERENTIATION 

uxx. X x^xvi-fi^ X xj^ X xvyx 1 




DNA REPAIR 


61 


DNA REPLICATION 

X^XtjeX XvX^X X^XV-n'i* X XV^X ^ 


62 


CFI L CYCLE CONTROL 


6"^ 


DELTA-DNA POLYMERASE COFACTOR 


64 


PROLIFERATING CELL NT TCI FAR ANTIGEN 


6S 


DNA SYNTHESIS 

L^k^Ix X X^ X XxJ <iJX>J 


66 


DNA POLYMERASE OR SUBUNTT 

L^iTiri- X v^JL^ X xvxjL^xvrx.w/x^ vx\. yjx^xx. 


67 


CYTOPIASM 


6R 
uo 


*?OT I TBI E FRACTION 

o \-/x^ w xj x^ 1 J X xvrx^^xxv^x^ 


60 


PR OTFTN Rin^YNTRFSTS 

ri\^V ' IwM^ JOXVyO X 1^ XXXJLfOXO 


70 


TR YPTOPHANYT^TRNA BIOSYNTHESIS 

X JV XXX V>/X XXx^LL~ X X>/^ XXvX^Xl^ XJXVi/ikJ X X XXJ /UXU 


71 




79 


PR OTFTN SYNTHESIS 

X xvVy X J-iXIN O X 1^ X XJLL>OXi3 


7'^ 


TRNA SYNTHETASE 

XXVi^i^ W> X 1^ X XXX^X i^OX-< 


74 


RNA-BINDING PROTEIN 

XxJ. izx xJXX^xyxi ^ V— ' X xvvyxx^xxi 


75 


RNA PROCESSING/MODIFICATION 

xxi^xi X xv\^Vji'X^c?wXi 1 ^j/XTxVi'x-'XX xVp'** X xvj^x^ 


76 


LIPID METABOLISM 

l^LI XX^ XVXU X X»,X/ V^X^XwXTX 


77 


AMINOPHOSPHOLIPID TRANSPORT 

f^LLVXXX^VyX XXX^L^X XXV./X-*XX 11^ XXX^XX^UX V^J.X.X 


78 


AMINOPHOSPHOLIPID-TRANSFORTING ATPASE 


79 


ACnVE TRANSPORTER, PRIMARY 


80 


AGEING 


81 


HEUCASE 


82 


DNAHEUCASE 


83 


3'-5'EXONUCLEASE 


84 


ADENOSINETRIPHOSPHATASE 


85 


AGING 


86 


NUCLEASE [ENDO, EXO, RIBO, DEOXYRIBO] 
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87 


PROTEIN PHOSPHORYLATION 


88 


NON-MEMBRANE SPANNING PROTEIN TYROSINE KINASE 


89 


SIGNAL TRANSDUCTION 


90 


CHROMATIN BINDING 


91 


EMBRYOGENESIS AND MORPHOGENESIS 


92 


EMBRYONIC DEVELOPMENT 


93 


CHROMATIN ARCHTIECTURE 


94 


GLUTAMINE AMIDOTRANSFERASE 


95 


METABOLISM OF ENERGY RESERVES 


96 


FRUCTOSE 6-PHOSPHATE METABOLISM 


97 


GLUTAMINE-FRUCTOSE-6-PHOSPHATE TRANSAMINASE 
(ISOMEREING) 


98 


ATPBINDING 


99 


MITOCHONDRIAL MEMBRANE 


100 


ATP-BINDING CASSETTE (ABC) TRANSPORTER 


101 


ATP-BINDING CASSETTE 


102 


RECEPTOR SIGNALLING PROTEIN 


103 


G-PROTEIN SIGNALLING, LINKED TO CGMP NUCLEOTIDE 
SECOND MESSENGER 


104 


PROTEIN KINASE 


105 


PROTEIN SERINE/THREONINE KINASE 


106 


NEUROGENESIS 


107 


NEURONAL DEVELOPMENT 


108 


PROLINE BIOSYNTHESIS 


109 


N-ACETYI^GAMMA-GLUTAMYI^PHOSPHATE REDUCTASE 


110 


OXIDOREDUCTASE 


111 


AMINO-ACID METABOUSM 


112 


SNRNPU5E 


113 


MRNASPUCING 


114 


MRNA PROCESSING 


115 


PRE-MRNA SPUCING FACTOR 


116 


RNA SPLICING 


117 


SPUCEOSOMAL SUBUNTT 


118 


TRANSCRIPTION CO-AdTVATOR 


119 


REPRODUCTION 


120 


CELL-TO-CELL SIGNALLING 


121 


PEROXISOMAL MEMBRANE 


122 


INTEGRAL PEROXISOMAL MEMBRANE 


123 


PEROXISOME ORGANIZATION AND BIOGENESIS 


124 


PEROXISOMAL LONG-CHAIN FATTY ACID IMPORT 


125 


CNS-SPECMC FUNCTIONS 


126 


1KB KINASE 


127 


IMMUNE RESPONSE 


128 


PHOSPHORYLATION OF I-KAPPAB 
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129 


ANTI-PATHOGEN RESPONSE 


130 


ACTIN BUNDLING 


131 


ACTIN CYTOSKELETON 


132 


CELL SHAPE AND CELL SIZE CONTROL 


133 


ACTIN CYTOSKELETON REORGANIZATION 


134 


CELL STRUCTURE 


135 


COMPLEX ASSEMBLY PROTEIN 


136 


Gl/S-SPECTFIC CYCLTN 

VJ X/ OX X^\^XL XV,^ X V,^X.jXJL^ 


137 


REGULATORY SUBUNIT 

IXX^Vxl WX.MX X V./XX X iJ UX' V^l IXX 


138 


APOPTOSIS INHIBrrOR 


139 


HISTONE DEACFTYLASE 






141 


ANTTMirROBTAT HUMORAL RESPONSE 

*TkJ.T XXlrXX^XXVyX^XTlUL^ xX\JlVX\JJ\u\X-t XVL-nJX \^1^\JX^ 


142 


SMALL MOLRCUI E-BTNDTNG PROTEIN 


143 


PHOSPHOMANNOMUTASE 

X XXVj'WX XXVyXTXITLl^JL 1 VyiVXl^ XixkJX^ 


144 

Alt 


PROTETNT GLYCOSYLATION 

X xvvy X x^xi ^ \jx^ X \^\jvj X x^^n. x xv^jl i 


145 


N-LTNKED GLYCOSYLATION 


146 


GDP-MANNOSE BIOSYNTHESIS 


147 


MEMBRANE FRACTION 


148 


BRAIN DEVELOPMENT 

Xi^Xvx^JLl^ X^X^ T 1 '* ■* ' XYXXjlLI X 


149 


PROTEIN BINDING 

k XWyXXJXL^ XJXL^X^XX^XJ 


150 


TRANSCRIPTION FACTOR COMPLEX 


151 

X -/ X 


TR ANSCRIPTTON REGULATION FROM POL 11 PROMOTER 

X XVrVL 1 kJ ^XxXX XXV./!^ XVX^VJ \JX.tr*. X XK^X^ X XvVyXVX X \^X^ XX i. X\.\^±VX\^ XXMt^ 


152 


GAS EXCHANGE 


X 


MirROSOMR 


X J*T 


PLASMA MEMBRANE 

X. Xj(rx\J>i.VXxx IVXl -^ItXIJXXJVLtX^ 


1 55 

x^^ 


VF^in E TARGETING 

V iJ*OXV^JLJ-> X /xJVVJi-i X 11^ VJ 


156 


ER TO GOLGI TRANSPORT 

x^x\. XV-/ vj\-/XwVjx xxv.niiox V./XV X 


1 57 


DY^TROPHTN-A^^OrTATED GLYCOPROTEIN COMPLEX 


158 


FLFCTRON TRANSPORTER 


159 


OUTNOUNATE SYNTHASE 

wxi ^ vyxjtxi^xx xx>« u X 1^ X xxr\\jx^ 


160 

X w 


OTHER METABOLISM 

\^ X XXXJXX ITi ,1 ^ X ;nLXW/X^Xt>JXTX 


161 

XVrX 


CELL MOTILITY 

v-'i i\ t\ ^ xyx\^ X XI /X X X 


162 


INVASIVE GROWTH 


163 


CYTOSKELETAL STRUCTURAL PROTEIN 


164 


PERCEPTION OF PEST/PATHOGEN/PARASITE 

JL JL^iJL W_^JL^A X X J> ^ Vv«^ A X X r X ^ A X JL V^ ■ *i ^/ JL A JUL VJ Ahi/ A X X^ 


165 


DNA RECOMBINATION 


166 


PROTEIN MODIFICATION 


167 


DOUBLE-STRAND BREAK REPAIR 


168 


SPUCEOSOME 


169 


ENDONUCUEASE 


170 


BASE-EXCISION REPAIR 


171 


ENDODEOXYRBONUCUEASE 


172 


URACIL-DNA GLYCOSYLASE 
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172 


DNA-(APURINIC OR APYRMIDINIC SITE) LYASE 


llA 


CENTROMERE 


17f 


iKINETOCHORE ~ 


17€ 


iANTI-APOPTOSIS 


ni 


' CHROMOSOME SEGREGATION 


178 


• NUCLEAR INNER MEMBRANE INTEGRAL PROTEIN 


179 


► CELL DEATH/APOPTOSIS 


18C 


ENERGY PATHWAYS 


181 


GLYCOGEN METABOLISM 


182 


L4-ALPHA-GLUCAN BRANCHING ENZYME 


183 


ENERGY STORAGE 


184 


CELLULAR DEFENSE RESPONSE 


185 


CLASS I MAJOR HISTOCOMPATIBILITY COMPT FY ANTTGFN 


186 


CELL ADHESION 


187 


ADHESIN/AGGLUTININ 


188 


NUCLEOLUS 


189 


NUCLEOPLASM 


190 


POLY-PYREVDDINE TRACT BINDING 


191 


HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN 


192 


DEFENSE RESPONSE 


193 


LYMPHOCYTE ANTIGEN 

■-^ A Al A J. A A^,^ A X X^ X aX ^ X X^^^^X ^ 


194 


VISION 


195 


EXTRACELLULAR MATRIX 


196 


PHOTORECEPnON 


197 


EXTRACELLULAR MATRIX COMPONPNT 


198 


PEROXISOME 


199 


CHROMATIN MODELLING 


200 


CELL-CELL MATRIX ADHESION 


201 


CELL MIGRATION/MOTILrrY 


202 


TRANSLATION ELONGATION FACTOR 


203 


HYDROXYMEIHYLGLUTARYL-COA SYNTHASF 

XXX ^-^ ^— ' -t X. X AT 1 1 / X XX X j^^^x^KJ X jCxXV X X_/ ^^V_/x\ O X X 1 X XxxxOX^ 


204 


DN A METHYL ATION 


205 


DNA (CYTOSINE-5-VMETHYLTRANSFERASE 


206 


STEROL CARRIER 


207 


STEROL TRANSPORTER 


208 


ESTRADIOL 17 BETA-DEHYDROGENASE 


209 


INTEGRAL MEMBRANE PROTEIN 


210 


ETEIANOLAMINEPHOSPHOTRANSEERASE 


211 


[NDUCnON OF APOPTOSIS 


212 


PROTEIN KINASE CASCADE 


213 ( 


:3IAPER0NE 


214 ( 


::ALCIUM BINDING 


215 


PROTEIN SECRETION 


216 


ENDOPLASMIC RETICULUM 
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917 


FMnnPT A ^Tv/nrp pttttpttt ttna vrKiv/rRT? A'WP 

JCrINlJV^rJL./\oiVlJLU ivCflll-'Ui-.UlVl iVJUQlVJLDJK-ftJNJCp 


/.Xo 


PW A PT7P n'MPQ 


910 


PP OTPTM TP A "NTQT HP A TTnXT 


990 




991 


PPT T PVPT P PPriTTT ATOP 


999 


HTsI A "n A TV>r A PP PPCPPlMQP A PTT\/ A TTHXr HP P^'^ 


99^^ 




99A 


PPrPfcPP IV^ A T "HPATT^T r^DA APXTT A "MT^ X/f A TKPTPXT A ISJPP 
jQr JLUiiKMAL Uii V JcJLUrJVLCrJN 1 AJNJJ MAUN 1 xiJN/viN UJtl 


99^ 


UVT\PPkPt7Kr TTD AKrCDO"DTTMP ATT> CVKTTXJA CP 




n I iJKUuJiN-lKAIN or UK 1 ITS Lr 1 WU-oJtlUlUK AlrAoJti 








A Tt> T^riOTTXTTlTTXTT "DXT A TTC7T T/^ A CC 

Air L'xirliJNDliJN 1 KJN A llrSLiUAotS 


990 

zzy 


KcorUINorl lU VxKUoJtio 


Z:5U 


AiN 1 1 V IKAL Kbor UJNb±i rKU 1 JciN 


9Q 1 
Zol 


D A TXJOPPXTTP TXT\7 A CTOXT 

r A 1 xlUloJCliN ICIJN V AMUlN 


Idl 


rJbrllL>b IKAMorUKl 


Zoo 


rJcrliiJii IKAlNorUKlxlK 


Zo4 


PM'7VX/rp APTT\/AT^^p 
cJNZ* I JVlii AUllVAlUK 


9*2 
ZOJ 


KrPi"M CPT PPTTVrP \7PCTPT P TP AXTCPOPT 
INUlN-oJtiLJDUll vxi VliolL^ljli IKAlNorUKl 


ZOO 


TPAMCT ATTPXTPAPTPP 
1 KAIN ol^ 1 lUJN rAUlUK 


Zo / 


PPPkTPTKT CVTvl'rUUCTC PT f^XTP A TTPXT 
rKU i HilN o 1 IN 1 JrUiSolo ilLUJNUAl lUJN 


9QC 
Zoo 


A /'"^nTTM PTT A AyTPXTT 

AL^ 1 ilN rlLAJVllilN 1 


zoy 


PTD A CP A PTTVr A Tr\P 
VolrAoJti AUilVAlUK 


Z4U 


(~V \ '!> A CP TXTXJTO TT^^'D 

vj 1 r AorS IJNJtlLDi 1 UK 


Z41 


A T Ti KCWW TT TXT T>TXTr\TXTO 


Z4z 


DTjD rDTJTTD A T T>T A CA/T A X>rD\>rDT) A XTD "D13 PVTTJTXT 

rilKJLrilnKAL rLAoMA MJblMoKAJNxi rKUliiUN 


z4o 


r^TT> A CC A OTTVr A THTXT/^ T5T5 /^T'CTXT 

UlrAob AUllVAlUNUrKUlliUN 


Z44 


LYoUoUMb 


Z4j 


T vc/^crMVATT r\^r^ a xtt^ a tpt/^xt a XTr\ ■dt/^/^ttivtcctc 
LioUoUMli UKuAJNJZAliUJN AJNU blUUcJNliolo 


Z40 


"DXTA T>t)00"CCCTXT/^ 

KJN A rKULbooUNLr 


Z4/ 


PPPTTTi A CP 

rJtir IJLUAob 


9/1 C 
Z4o 


A P CPXTTTP TD A XTCT5/^15 TPD 

AKoxllNllb IKAINorUKlliK 


9/1 0 
Z47 


KTTTPT PPlPVTPfcDT A CTVyTTP TD AKTCPPPT 
IN UUUdUU 1 1 Ur bAoMlU 1 KAJNor UKI 


9^n 

ZOU 


MTTPTPAP PVrnPT ACA^P TP AXrCPPlPT 
IN U^Ul AK-l^ I 1 Ur JLAoiVlIU I KAiN Dr UK 1 


9^1 
ZOI 


PH TTPnCP P ATA POT TCTV/f 


9^9 


riT YPPP OT % PTTO QPW A TP r»PTTVTiP PlPPM A QP 
vji-f I v-^JCjxUi^o-rxiUorri/\ 1 JC( i iJxvu\jrxiiN/\oii 


ZOO 


MTTTPTP>JT APCnPPTTnM 


254 


CYCLOPHDLIN 


255 


ISOMERASE 


256 


SPECIFIC RNA POLYMERASE E TRANSCRIPTION FACTOR 


257 


PROTEIN FOLDING 


258 


HEUEROCHROMATIN 


259 


VIRULENCE 


260 


H3/H4 fflSTONE ACETYLTRANSFERASE 



261 


METALLOCARBOXYPEPTIDASE 


262 


CASEIN KINASE U 


263 


JAK-STAT CASCADE 


264 


ACUTE-PHASE RESPONSE 


265 


HEMATOPOHTIN/INTERI^ON-CLASS (D200-DOMAIN) 
CiTrOKINE RECEPTOR SIGNAL TRANSDUCER 


266 


SIALYLTRANSFERASE 


267 


AMINOSUGAR METABOLISM 


268 


GLYCOLIPID METABOLISM 


269 


LIPID:PROTEIN MODIFICATION 


270 


DNA TOPOISOMERASE 


271 


DNA TOPOISOMERASE (ATP-HYDROLYZING) 


272 


TOPOISOMERASE 


273 


DNA METABOLISM 


274 


DNA-DIREC 1 liD RNA POLYMEEIASE I 


275 


TRANSCRIPTION FROM POL I PROMOTER 


276 


RNA POLYMERASE I TRANSCRIPTION FACTOR COMPLEX 


277 


POL I TRANSCRIPTION 


278 


T»X.T A TPkrf^T XTTk KT"*T\ A OT? OT TOT TXTTTI 

RNA POLYMERASE SUBUNIT 


279 


TYROSINE RECOMBINASE 


280 


26S PROTEASOME 


281 


19S PROTEASOME REGULATORY PARTICLE 


282 


TVIX ^NrTTT'i'i vT TXT^^TIX A TX A rx 'X /~\ X T 

PROTEIN DEGRADATION 


283 


f-^-l^v y—\r ■ t'f < A #^ M t'^ O X TXX T TX TT^X* 

PROTEASOME SUBUNIT 


284 


ASPARTIC-TYPE ENDOPEPTIDASE 


285 


GUANYLATE CYCLASE 


286 


RECEPTOR GUANYLATE CYCLASE 


287 


« iTi^xy^r ■ 1^ xxx-i>^^^x jtx^xxt a n ixyxx x 

MEIOTIC RECOMBINATION 


288 


* If ir tn^^rxtxy^ xxx^rf^^^x jttxxxt a rx<xy^xT 

MITOTIC RECOMBINATION 


289 


X^ X^ "VTA X^X* y^ yvX^ XX T ^"1 

RRNA PROCESSING 


290 


fSX JT A T T XTTT,^XX T^/*\T A T* TXXT A 

SMALL NUCLEOLAR RNA 


291 


/^f 1 < 1 1 > JT> TXT7X 7T7T ^X'fXX AT?XTT^ 

OTHER DEVELOPMENT 


292 


MALE MEIOSIS 


293 


IkANSCRDPTION FACTOR i^JLlti 


294 


T^T'i A XTOi~tT>TTVX>T/^XT TXTTTT A TT/^XT TTT^ /^X JT T>^T TT TIT>/^X JTi^TTTn 

IkANSCRIPTION XNITIATION FROM FOL 11 FRUMOTER 


295 


GENERAL RNA rOLiMERAob U IRANSLRir IlUN rAL lOR 


296 


ARYLESTERASE 




1 UMUK AlN 1 ILrJtiN 


298 


INFLAMMATORY RESPONSE 


299 


ANTIBACTERIAL HUMORAL RESPONSE 


300 


RESPONSE TO PATHOGENIC BACTERIA 


301 


ATP DEPENDENT DNA HEUCASE 


302 


PROTEIN COMPLEX ASSEMBLY, MULTICHAPERONE 
PATEIWAY 



69 



302 


CARBOHYDRATE METABOUSM 


304 


ISOCITRATE DEHYDROGENASE (NAD+) 


30f 


iGTPASE 


3oe 


> OTP-BINDING PROTEIN/GTPASE 


301 


' RRNA TRANSCRIPTION 


308 


> TRNA TRANSCRIPTION 


30S 


> TRANSCRIPTION FACTOR TPDIC 


31C 


1 TRANSCRIPnON FROM POL HI PROMOTER 


311 


RNA POLYMERASE HI TRANSCRIPTION FACTOR 


312 


POL in TRANSCRIPTION 


313 


INTRACELLULAR SIGNALLING CASCADE 


314 


GOLGI APPARATUS 


315 


ARF GUANYL-NUCLEOTIDE EXCHANGE FACTOR 


316 


GUANINE NUCLEOTIDE EXCHANGE FACTOR 


317 


RNA ELONGATION FROM POL II PROMOTER 


318 


POSITIVE TRANSCRIPTION ELONGATION FACTOR 


319 


MICROTUBULE 


320 


STRUCTURAL PROTEIN 


321 


GUA CELL DIFFERENTIATION 


322 


PHOSPHOLIPID BINDING 


323 


SKELETAL DEYHjOPMENT 


324 


CARTILAGE CONDENSATION 


325 


BONE DEVELOPMENT AND MAINTENANCE 


326 


NUCLEAR PORE 


327 


RAN PROTEIN BINDING 


328 


IMPORTIN, BETA-SUBUNTT 


329 


NLS-BEARING SUBSTRATE-NUCLEUS IMPORT 


330 


NUCLEAR LOCALIZATION SEQUENCE BINDING 


331 


RECEPTOR (PROTEIN TIIANSLOCATION) 


332 


PROTEIN COMPLEX ASSEMBLY 


333 


PROLYL-TRNA BIOSYNTHESIS 


334 


GLUTAMYL-TRNA BIOSYNTHESIS 


335 


PROTEIN ADP-RIBOSYLATION 


336 


CELL GROWTH AND MAINTENANCE 


337 


NAD(+) ADP-RIBOSYLTRANSFERASE 


338 


LARGE RIBOSOMAL SUBUNTT 


339 


STRUCTURAL PROTEIN OF RIBOSOME 


340 


GENERAL CELLULAR ROUE 


341 


RIBOSOMAL SUBUNTT 


342 


MEMBRANE. 


343 ( 


CIRCULATION 


344 


POSITIVE CONTROL OF CELL PROLIFERATION 


345. 


^GIOGENESIS 


346 


HEPARIN N-DEACTYLASE/N-SUIJOTRANSEERASE 
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Dni 


TTTT n\/nPPTH MATNTTPMATSIPF 
1 JtllA^'IVJLCilNjl IVJLnJiN 1 XUL > -rtiN v^H 










jjyj 






TnTTTlFTA PFPPPTOP ^TfrNAT T TNG PATHWAY 






353 


PROTEIN KINASE INHBITOR 


354 


UiiiiMU 1 AAxb 




LTXiObFrlUDiM IbKAbii 1 


ijo 


DXJr^ODXJ ATT? "N/nCTAPP^T TCA/T 


OCT 


[N UCUiU i iUxl Jr 1 KUrxlUorxlA 1 Aoil 


O CO 


rKANoCKLr 1 lUJN r 1 UK JdJLINI>JlJNU 


'3CO 


'D'D /^nCTXT T TNTVCT^ 'D'DnJTyTf^P "DPOTThTXT QTriMAT T TKm 

Lj-JrKU 1 xiUN i JNKKIj KxiCJtir 1 UK r^KU i JiilN olVJiNAI J AisKj 
rAlrlVv A i 




OTTTRP PWriQPMATA 
U X jIqK -TXlUor rlA 1 Aoxi 




nTTANTYT ATPTTTNTAW 
ijrUAiN IJ-jAIE iSJiN/\oX3 


30Z 


OTTfTPP ITTMACIP 
KJ X JICK ISJiNAoJCl 




\yfOTnP 


Dm 


iNUXN-lYlUo^JuC lYl I l^OllN 




N/rnTHP PPHTPIN 


300 


T^pn YVPVTmTMP 'K'TTsT A 
Ut2\Jj^ x\^x XxUxnXjt JSJi^lrVOll 


30/ 


DVPT^/rmTMP MTTPT PnTTTlP X/TPTAPnT T^M 


30o 




3oy 


T TDTn TP AXTQPPiPT 


3/U 


riJiAKJJNLj 


3/1 


nrr T r^vr'TP apppqt 


51 L 


\/rwr\TTr^ ni /c tp Axrcmoxr 
iVlilUllU Lrl/o IKAJNolllUJN 


3/3 


rNrnTTPTrnM np aphptpiqiq pv ttnttp appt t ttt ap ^thmat ^ 


3/4 


OP rvrPTXT PtrnQppf a t a qp i PTMnTKro 


3/j 


DP OTPTM VTM A QP A A MPPrnP TMf^ PP OTPTNT 


3/0 


AKTPPrPlP PPnTPTM 


Dl 1 


LAJriXJ JDXlSxJXxSKJ 


J to 


APTTVATTON OF A/rAPT<r 


3/y 


n PPPlTPTM T TMTTPn PPPPPTHP 


3oU 


pprn <;PTTnp yt a ^F tttm a ^f 




TR AN<?PRTPTTON TFRMTN ATTON FROM POL TI PROMOTER 

X JV/VL T 0\-^XNJi X X X XJrXvLVXXl^/T X X\-/l l X^XvviVX ± \-/X^ XX X JWJiJXK^ X JutlX 


382 


CENTRAL NERVOUS SYSTEM DEVELOPMENT 


383 


OXIDATIVE STRESS RESPONSE 


384 


CELL STRESS 


385 


RECEPTOR 


386 


CELL SURFACE RECEPTOR LINKED SIGNAL TRANSDUCTION 


387 


UGAND 


388 


APOPTOSB 
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PThT T PPT T QTnXTATTTNTn 


oyu 


TTP A T ^WnPTT PP nTCTKT 
nJDrV i oXlUV^JV rivw 1 JiliN 


oyx 


TPATSJQT ATinKTAT PTJnTTT ATTmsT TXTTTTATTmsj 




MPM A PT P A V A 


Dyj 


MPMA POT V A "nT7"MVT ATTHM 
iVJisiV A rKJLi i AJJxiJN 1 1-A 1 lUIN 




PTTHT TMPCTThP a CP 


jyo 


LAJVlilNilN KiiClirlUK 


jyo 


LAMIJNIJN KCrUiir lUK rKUlliirS 


jy / 




jy© 


rAl 1 1 ACID UlloAl UKAllUJN 


jyy 


tXjr KbLlir 1 UK DUWJN KbCjULA IIUIM 


4UU 


x jrrr^Tt rWJ TOT TT "D XTT T^T "C A TT/^XT 


4U1 


MlCKUl UliULD AooUClAlriL' rKUlliliN 


/I no 


TXTTT7T5T T7T TVIKT O "D "D ^"^'CXyT'/^'D 

UN 1 JbU<LJiUJsJLIN-2 KbUlir 1 UK 


4U3 


UN lllKl-bUJsJLiN-4 KxlCrSJr 1 UK 


4U4 


UNlliKLliUJsJJN-/ KbCiirlUK 


4UD 


UN llivjKLlN 


4UD 


P^^T T A PPM PTNiriTMP 


4U/ 


PT POT^ PPAPTTT ATTPXr 

dLUKJu UUAUULA 1 lUJN 


4Uo 


PPT T ATMJPCTPXT PPPPDTPP 

L,liLL ADJiliolUJN KcCxir 1 UK 


4uy 


■pTCTPPTTMPCTC ATVJTi PPPANTPPPXTPQTQ 

rUo 1 UUiiiNliolo AJNL? UKijrAJNULrlilNilolo 


41U 


PT r\C\T\ PT P'IM'INTP 

i>L\J\J\J CLU 1 lUNvj 


411 


IKAiNbMJiMbKAJNll KcCiirlUK rKUlxiUN 1 iKUollMli KUNAoli 


41Z 


P"vnrpcpT TP T Appp PTPPCPTv/r AT cttpttkttt 
U 1 1 UoULlC LAKljli KLdUoUMAL (oUo )-oU1> UJNl 1 


413 


UJLAbb 11 MAJ UK xllo 1 UUUMr A 1 IrSlLl 1 Y CUMPLcA AN 1 lunN 


414 


A Try a Tncrri A tt^^xt r^u a xtxtct 
Air-VjAlllLi CAIIUIN UrlANiNiiL 


y| 1 C 

415 


UUlN AD Db VliLUrMliJN 1 


41o 


UbKM LnLL MIUKAIIUIn 


417 


CHOUESTEROL METABOLISM 


418 


CHOLESTEROL BIOSYNTHESIS 


419 


/^T7T> A JTf TXTT7 H ^ A TXTTTJXT A XT/^^T? 

(j^bRMJ^IJNE MAlTNlBNANCb 


421) 


uULvri Ulb-r ALii 


421 


T\'VXT A A/TTXT OTTl A CD 

DYJNAJVllJN UlrAbll 


422 


/^"DT T /^rMl VTA /TT TXTT/^ A TT/^XT 

CKI J > CUMM U JNIC A 1 lUJN 




A/rrrppTTPXTTiP tat AyrpK/TDP a xtp op n a xttz a ttoxt a xtt^ 
IVJilUUxlUJNDKlAJL JMUolVlii UKL1AJNIZ.AIIUIN AJND 

RTnnPMPCTC 






425 


TNOSTroL/PHOSPHATIDYLINOSrrOL XTNASE 


426 


PROIEIN-PEROXISOME TARGETING 


427 


Peroxisome targeting signal-i receptor 


428 


RAS GTPASE activator 


429 


CELLCYCUE 


430 


JNK CASCADE 


431 


inactivation of mark 
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MbbUDxiKM Db VELOPMbN 1 




INDUCTION Or APOPTOSIS BY E?GlCACliLLuLAR SIGN ALo 


434 


MISMATCH REPAIR 


43j 


DNA REPAIR ENZYME 


436 


DNA REPAIR PROTEIN 


437 


EII&ARYOTIC TRANSLATION INITIATION FACTOR 2 
COMPLEX 


y4 O O 

438 


EXTRACEIXUlAR SPACE 


439 


BLOOD COAGULATION FACTOR IX 


440 


PROTEIN PHOSPHATASE TYPE 1 CATALYST 


441 


REGULATION OF G-PROTEIN LINKED RECEPTOR PROTEIN 
SIGNALLING PATHWAY 


442 


METHYL TRANSFERASE 


443 


3 (2 ),5 -BISPHOSPHATE NUCLEOTIDASE 


AAA 

444 


NUCLEOBASE, NUCLEOSIDE, NUCLEOTIDE AND NUCLEIC 
ACID METABOLISM 


/I /I c 

445 


CY lULliKOME-C OXlDAbB 


A AH 

44o 


T T7 A T) KTTIVT/^ 


/I /IT 

447 


rEcDlNG BEHAVIOR 


/I yf O 

44o 


r>T> r^TLJ TXT T'vrn /^oTxrn? tt'TXT a ct7 

FKUlblN 1 YROblNE KIN ASJi 


44y 


MblilLUNlNE— IKNA LlLrAoii 


450 


ACllN MUDULAllNu 


451 


NUCLhUllDh-EALlolON RxlrAlR 


452 


SINGl^-STRANDED DNA BINDING 


A C5 

453 


TTVTT^ A /~»'C7T TTTT ATkX^TA TT> TV CTTJ T T/"V-nT Tn A T WTi /^'prjTXT 

EXTRACELLULAR MATRIX STRUCTURAL PROTEIN 


454 


A T^TTXT/^ OTXIT? TN"C7 A X jITXT A PT? 

ADENOSINE DEAMINASE 


A CC 

455 


A TNT7XT/^ OTKTC? TNT? A jTTVT A OT7 TiT? A /"vnT^XT 

ADENOSINE DEAMINASE REACTION 


456 


RAS PROTEIN SIGNAL TRANSDUCTION 


457 


RAL GU ANYL-NUCLEOTIDE EXCHANGE FACTOR 


458 


ox if A T T 1 »|> A OT? TlTn/^TTT A np/^'oxr /rx ir lit ? A /"VTTXTrf^ TiTl /^TflTKT 

SMALL GTPASE REGULATORY/INTERACTING PROTEIN 


A C(\ 

459 


COAIED VEbiCLc 


450 


bECKblUKY VEolCLb 


40 1 


ViiolCiJi IKANoPUKl 


4dZ 


ViioW^Lil i-UA i rKU 1 JtiliN 


403 


muxl-DJtilNoll I JJJrUrKUlJtlJLIN 


404 


TTvr 1 'U'P f^TJT T TTT AT? TT? AXTC!T>r\'DT 

IN lliKCxiLi-UJLAK IKAJNorUKl 


405 


DNA DAMALrii K±ioJrUJN oil 






467 


INTRACELLULAR PROTEIN TRAFFIC 


468 


t>ROTEIN DEPHOSPHORYLATION 


469 


PROTEIN TYROSINE PHOSPHATASE 


470 


RIBOSOME BIOGENESIS 


471 


UBIQUmN UGASE COMPLEX 


472 


UBIQUTDN CONJUGATING ENZYME 
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UBIQUii IN-DEPENDENT PROTEIN DEGRADATION 


4/4 


PROTEIN CONJUGATION FACTOR 


Anc 
475 


CHAPERONIN ATPASE 


Alt 


NUCLEIC ACID BINDING 


Ann 

477 


HEAT SHOCK RESPONSE 


478 


NADPH:QUINON0E REDUCTASE 


Ann 

479 


TITT^N OT^TTrf^^^T xr/^r ' I% A >'IM^ T^TTXT A OT^ 

PHOSPHOGLYCERATE KINASE 


480 


FK506 BINDING 


481 


MITOCHONDRIAL MATRIX 


482 


ELECTRON TRANSFER FLA VOPROTEIN 


483 


PROTEIN PHOSPHATASE TYPE 1 


484 


MITOTIC CHECKPOINT 


485 


ANAPHASE-PROMOTING COMPLEX 


486 


SIGNAL RECOGNITION PARTICLE 


487 


DIACYLGLYCEROL KINASE 


488 


PHOSPHOUPASE C ACTIVATION 


489 


CYTOSTOLIC CALCIUM ION CONCENTRATION ELEVATION 


490 


PHOSPHORIBOSYLGLYCINAMIDE FORMYLTRANSFERASE 


491 


TRANSLATION INTTIATION FACTOR 


492 


EUKARYOTIC TRANSLATION INITIATION FACTOR 3 
COMPLEX 


493 


RAN GTPASE ACTIVATOR 


494 


SIGNAL SEQUENCE RECEPTOR 


495 


CO-TRANSLATIONAL MEMBRANE TARGETING 


496 


DEOXYRIBONUCLEOSIDE MONOPHOSPHATE BIOSYNTHESIS 


497 


IMPORTIN, ALPHA-SUBUNTT 


498 


REGULATION OF DNA RECOMBINATION 


499 


NUCLEAR IMPORT/EXPORT PROTEIN 


500 


SPINDLE 


501 


CENTROSOME 


502 


CYTOKINESIS 


503 


SPINDLE POLE BODY 


504 


POLYSOME 


505 


i^OTIC SPINDLE CHECKPOINT 


5Uo 


CARBAMOYL-PHOSPHATE SYNTHASE (GLUTAMINE- 
ilYDROLYZlNG) 


DU7 


TN"nT TO TOT 1 rfvnr attoxt 
DbUiJlCjlUlI YLAnUN 


DVo 


U I o 1 xlJLLNxl- 1 1 Jril JcJNlJwr^Jir 1 11/ Aoii 


509 


UBIQUmN-SPECMC PROTEASE 


510 


ENDOCYTOSK 


511 


RAB GIPASE ACTIVATOR 


512 


INSULIN RECEPTOR SIGNALLING PATHWAY 


513 


RNAHEUCASE 


514 


LYSINE-TRNA LIGASE 
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515 


InUCLEOSOME ASSEMBLY 


516 


• CHROMATIN ASSEMBLY COMPLEX 


511 


CALCIUM ION TRANSPORTER 


518 


INOSITOL-l,4,5-TR]PHOSPHATE RECEPTOR 


519 


DNA-DIRECTED RNA POLYMERASE H 


52C 


ASPARTATE CATABOUSM 


521 


CYTOCHROME P450 


522 


-EYE MORPHOGENESIS 


523 


EXOCYTOSIS 


524 


SNAP RECEPTOR 


525 


MEMBRANE FUSION 


526 


NON-SH^CnVE VESICLE TARGETING 


527 


DOCKING PROTEIN 


528 


PROTEIN TARGETING 


529 


REGULATION OF CDK ACTTVITY 


530 


EUKARYOTIC TRANSLATION INITIATION FACTOR 4 
COMPLEX 


531 


SNRNPU2E 


532 


SNRNPUIE 


533 


SMAIX NUCLEAR RIBONUCLEOPROTEIN 


534 


PROTEIN LOCALIZATION 


535 


SERPIN 

"X^XXX XJk 1 


536 


ENZYME INHIBrrOR 

* 41 ^JLj X. XT 1,1 Xi. ^X 1 .1 1 JXx V/xV 


537 


N-METHYLTRANSFERASE 


538 


N-TERMINAL PROTEIN MFTHYT ATTON 


539 


IMP CYCLOHYDROT ASF 


540 


FORMYLTRANSFERASE 

X V^XXXTX X J—t X X XX 1 X^XVX^ta^X^ 


541 


MITOTIC G2 PHASE 

Jk'XXX X XVa' X^Atf X XXf^lkifX^ 


542 


SPINDLE POLE BODY AND MICROTlJBULE CYCLE rSENSU 
SACCHAROMYCES) 


543 


GMP SYNTHASE 


544 


PURINE BASE BIOSYNTHESIS 


545 


PNA DEPENDENT DNA REPLICATION 


546 


DNA REPUCATION FACTOR A COMPLEX 


547 


NUCLEOTIDE BINDING 


548 


DNA REPUCATION CHECKPOINT 


549 


DNA REPUCATION INHIBITION 


550 


MITOTIC START CONTROL POINT 


551 


TEMPERATURE RESPONSE 


552 


TRANSCRIPTION 


553 


RECEPTOR SIGNALLING PROTEIN TYROSINE KINASE 


554 


DAMAGED DNA BINDING 


555 


PYRIMIDINE-DIMER REPAIR, DNA DAMAGE EXCISION 
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DDV 


VTP A T PThPT TP ATTHM 


557 


PROTEIN C-TERMINUS BINDING 




X>X> OTT?TXr PTJOCDTJ ATA CP? TVPT? 0 A 




DD OTTJTKT TJUOCDTJ ATA CT3 TVPT7 0 A P P/^T TT A TOP 

rKU 1 JtlJJN r JdUorxlA 1 Aoxi 1 1 rii ZA KJtiUUJL.A 1 UK 




MT TOT X? A P "N/rCTN/rDP A VTC 

IN UULilAK MbMxsKAJNii 




MAJLA 1 Ji MJti 1 AJt5UJLlo JVl 




TTDTP AT>Pr\WT TP A PTFi PVPT TT 

1 KiL AKi5U A i JLlU AUiU C i UJLti 


J 03 


TVyr ATA TT3 r\TJXJVT\P OnXTM A CT3 ffW AT O A PT7T A TP 

MALA 1 Jb DJitl 1 L/KULrJtilN Aosl (^UA ALU AUJG i A I Ji 
UliCAKDUA X JLA 1 JUN Lr^ ^IN A-Ur+; 




MblUolo 




QT>T70TVyr ATOl^TTNTPCTC 

o Jt^JiKJVlA i UUeJN Jio io 


DDO 


olo IrSK UxlKUMA 1 nJ UUJtlJbolUiN 


DO/ 






rTJPkT P CTPP r\T P A T A Tir\T TQA^" 


Doy 


CTPPPTH TTOPTV/TOXTR PPPPPTOP 


J l\J 


TD TDTrtynr^ VT "Dporm A ct? tt 
1 KJLriir 1 UJ i i-rrlir 1 JLU Aoc JUL 


D /I 


r\ A T A MVI Tl A T A XTTMTh PXTHPlPPP' 1 ' 1 1 A QP 
U-ALAJN I Li-J^-AJLAJNITN Ji iilNJJUJriir llU/Vi^li 


3 /Z 


DXJOCPXrrM TP A CP P 


J Id 


DUTi QPXJPlT TPTn A/TRTAPni TCA/T 


J /4 


'TP AMCPPTDTTOM POTTPT PTi PPPATP 


J /j 


A PTT\7" A TTOXr OP TT TM TTTM A QP 
AUllVAilUXN Up JUiN JSJLiNAoii 




A T^P'^TVT A TP VTXT A CP 

AJDJiJN YIjAI Ji JsJUN AoJi 


J 1 1 


KyrrPPOTTTDTTT P PTTsJnTKTO 




TvyrFTPkTTP m/TVA" TP AMCTTTOXT 
MllUllU VjZ/M IKAJNMiiUJN 


D/y 


o 1 KJiob KiibrUiNoii 




r^r\xTTD r\x r\xs xrc apt 
UUJN 1 KUJL Ur xUiAKl 


CO 1 


■\yrTTC/^T "C /^rWPTD A i^TTPtVI 

M U ^LJLb CUJN 1 i<AU 1 lUJN 


coo 


CTTDTTl^^^TTTD A T DtDr^TCTXT /^P "N^TTCPT P 


coo 


MUbLLb AUllUPs 


CO/I 


lir JL>liKMAL DJUrrliJ<JbiN 1 JLA 1 iUiN 




PTT T\7T7PTlTTvT PTTHTTPTA CP 


DoO 


A/TAP T^TNTA CP 
MAir JSJlNAoJi 


Do / 


XnTAAyTTNT 'NAPT ATIOT TCNA 


Doo 


PTTTTNJOTn Y PPPPPTOP 


Doy 


PPTTMOTP A PTH PPPPPTOP 


jyu 


TPOM TTOMPO^TA ^T^ 




PPNTPTOT P 


592 


CENTROSOME CYCLE 


593 


TRANSCRIPTION ELONGATION FACTOR 


594 


RNA DEPENDENT ADENOSINETRIPHOSPHATASE 


595 


ISOPRENOID BIOSYNTHESIS 


596 


ISOPENTENYL-DIPHOSPHATE DELTA-ISOMERASE 


597 


TRANSCRIPTION REGULATION FROM POL HI PROMOTER 


598 


CYCUN-DEPENDENT PROTEIN KINASE 
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599 


^SCLE MYOSIN 


60C 


IPH0SPH0LIPASEA2 


601 


MITOTIC CHROMOSOME SEGREGATION 


602 


. OLIGOSACCHARYL TRANSFERASE 


603 


MYOSIN ATPASE 


604 


OTP BINDING 


605 


CELL SHAPE CONTROL 


606 


SMALL GTPASE MEDIATED SIGNAL TRANSDUCTION 


607 


HYDROGEN-TRANSPORTING ATP SYNTHASE CATALYTIC 
CORE 


608 


MUSCLE DEVELOPMENT 


609 


PREGNANCY 


610 


UBIQUmN-PROTEIN LIGASE 


, 611 


ACYLTRANSFERASE 


612 


FATTY ACID METABOLISM 


613 


GUANYLATE CYCLASE, SOLUBLE 


614 


NO MEDIATED SIGNAL TRANSDUCTION 


615 


FATTY ACE) CATABOUSM 


616 


PROPIONYL-COA CARBOXYLASE 


617 


PROTEIN ACETYLATION 


618 


NUCLEOSOME REMODELLING COMPLEX 


619 


CASPASE-2 


620 


APOPTOTIC PROGRAM 


621 


7-ALPHA-HYDROXYSTEROID DEHYDROGENASE 


622 


INOSITOL-l,4,5-TRIPHOSPHATE5-PHOSPHATASE 


623 


UGAND-DEPENDENT NUCLEAR RECEPTOR 


624 


ACITN BINDING 


625 


POST GOLGI TRANSPORT 


626 


PYRIDOXAL KINASE 


627 


LAMIN 


628 


NUCLEAR LAMINA 


629 


SERINE PROTEASE INHffilTOR 


630 


STEAROYL-COA 9-DESATURASE 


631 


HEPATOCYTE GROWTH FACTOR RECEPTOR 


632 


BILE ACID BIOSYNTHESIS 


633 


OX YSTEROL 7-ALPHA-HYDROXYLASE 


634 


POSTERIOR MIDGUT DEVELOPMENT 


635 


EXTRACEULULAR 


636 


MQCROTUBULE STABILIZATION 


637 


EGF RECEPTOR SIGNALLING PATHWAY 


638 


NOBUROTROPHIN TRKA RECEPTOR 


639 


IRANSMEMBRANE RECEPTOR PROTEIN TYROSINE KINASE 
SIGNALLING PATHWAY 


640 


METALLOPEPTIDASE 
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641 


MTTOCHONDRIAL TRANSLOCATION 


642 


MITOCHONDRIAL INNER MEMBRANE TRANSLOCASE 
-COMPUEX 


643 


FRUCTOSE METABOUSM 


644 


CYTOPLASMIC DYNEIN 


645 


ARP2/3 PROTEIN COMPLEX 


646 


CELL ELONGATION 


647 


NADH DEHYDROGENASE (UBIQUINONE) 


648 


GAMETOGENESIS 


649 


MEIOTIC CHROMOSOME 


650 


DNA DAMAGE CHECKPOINT 


651 


MITOTIC CHROMOSOME CONDENSATION 


652 


DNA REPLICATION AND CHROMOSOME CYCLE 


653 


CAMP-DEPENDENT PROTEIN KINASE REGULATOR 


654 


EUKARYOTIC TRANSLATION ELONGATION FACTOR 1 


655 


GOLGI MEMBRANE 


656 


MANNOSE BINDING LECTIN 


657 


PHENYLALANINE-TRNA UGASE 


658 


PHENYLALANYL-TRNA BIOSYNTHESIS 


659 


LIGAND BINDING OR CARRIER 


660 


ELECTRON DONOR 


661 


ACYL-COA OXIDASE 


662 


CELL AGEING 


663 


DNA-DIRECTED RNA POLYMERASE m 


664 


TRANSCRIPTION REGULATION FROM POL I PROMOTER 


665 


RIBOSOME 


666 


SIGNAL RECOGNITION PARTICLE RECEPTOR 


667 


LONG-CHAIN-FATTY-ACID-COA-LIGASE 


668 


MONOOXYGENASE 


669 


TRANSLATIONAL ATTENUATION 


670 


TROPOMYOSIN BINDING 


671 


ACTIN CAPPING PROTECT 


672 


CHROMATIN 


673 


PROTEIN-NUCLEUS IMPORT 


674 


LAMININ-5 


675 


DEFENSE/IMMUNITY PROTEIN 


676 


LANOSTEROL 14-ALPHA-DEMETHYLASE 


677 


SH3/SH2 ADAPTOR PROTEIN 


678 


RHO PROTEIN SIGNAL TRANSDUCTION 


679 


kCm FILAMENT SEVERING 


680 


ACTIN POLYMERIZATION/DEPOLYMERIZATION 


681 


RAB GDP-DISSOCIATION 1NHIB]T0R 


682: 


XENOBIOTIC METABOLISM 


683 


DETOXIFICATION RESPONSE 



78 
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P VTOPTTP CWAU "R < P "RTM TPT A 

\^ I 1 \JV-^JQJV\JiYUl 15 J KJDlJUV^l A^Ii 


Oo3 


KTTTPTP nVTHT? "DTOCVKI' \ 'Ul U' QTQ 


DoQ 




Do / 


C VTvT A PTTP TP A MCN/TTQ QTHM r'WOT T?srpP HTP 


ACQ 


T AX/fTKT PTxrnTMn 
JLAiVUJN JiJLINL'JJNVJ 




T A TV/rTNT/PTTD C\\f[ A TTM PTMHTMri 




AM X XJJUJ rjsXJ 1 JtiliN 


. 691 


MRNA BINDING 


by2 


CjDP-DIoSOCIATIOJN INHiDllUK 




Tk. >| 1 ' LTT7XTVT m^'TO A TTVT\T> /^"C/^T A '"I'C? /^'V/^T /^XTVrr\t> /^T A CC 

MKiiinN YLlblRAJniDKUrULAlb C iLLUHiDKULAoil 


594 


> yi 1 i^T T\7T T"7X lTyT^T?nPTl A T T\7T\T> ^"D/^T ATT? T^T?TT\/T\T1 /^OTTXT A CD 

METHYI^NbrjbTKAHYJDKUrUi^ DJEoYDKUuJciNAbJbS 


695 


S ATElJLrrE DNA BINDING 


696 


LIPID PARTICUE 


09/ 


XT/*^XT X JTTTfc >!T»Tr» A XTT? OT> A XTXTTXT/^ T>T> ^TT^TXT IPVT* /^PT\TT3 

NON-MEMBRANE SPANNING PROTEIN TYROSINE 

I>TJO CPU ATA CT3 

rxlUorrlA I AoH 


d9o 


o UriiKUAiUE ME i Ad UEioM 


099 


1 L|' 1 L' <A1 T^T40YVXrVT>TTCTKTn CVXTTTJACTh 

[lilr-!)AJ-DJb.UAlilirUoJJNrl- o i JNiJHLAoJa 


/UU 


POAyTDT "CY T /ATAT^tJ TO T TPTOT m^JO'^^C^ 
UUiVLr l jr,?L 1 (INAUxl lU UiJiyUJLiNUJNJti^ 


/Ul 


JVL IrxlAoii 


/U/ 


PVTPl C T3T A T PP OTCTM PTXTTfcTMn PP OTCTKT 


i\)D 


DTJOCPWOT TP A CP A 1 
rnUorilUlJLr AoJCi Al 


n(\A 


rxlUoPxlA 1 iU I LocKlINii Mil 1 AiSUJLloM 


7n< 
/U!) 




/Uo 


orEKMAliL) JUtViiLAJrMJcJN 1 


/U/ 


UJNA KilrLlL.AIiUN UKlUilN xJ UN DING 


7Uo 


DJNA KbirLIGAIlUN rAGIUK 


/U9 


DNA KJirLiGAllUJN rALlUK G GUMrUbA 


/lU 


MlIU iiG Gl rrlAoE 


/II 


irLlKAGlGULNE IKANorUKIiiK 


Tl O 

/12 


A r^TTXTT? TP A XTCPOPTPP CTTOOMT* A P V 

AGIiVE IKANorUKlEK, oxiUUlNDAKY 


1 Id 


\ 4" A Tr^P PA PTT TT ATOP CTTPTTPnAAATT V 


/14 


PTTPTTSJCT MTir'T ThO'I'IT'^'R PTOQVTvI' 1 'U U'CTQ 

rUKJJNJti JNUUxJiUliUJc. DlUo x IN IJnJCfOio 




AAyrmoPTxriCPTTnPTRriQVT tp amcttrp a cd 


71 A 
/lO 


PPOTT^fM 'MTTPT T7TTQ TA/TPOPT TP AXTCT OP ATTOXT 


717 

/ J. / 


TMTPnP AT PT AQIV/TA IWrPlVyTRP AKTP PPnTROnT VP AM 


71ft 


nMA QTR A ?<rn PT rjNn A TTmsT 


719 


TRANSKETOLASB 


790 




721 


IRON TRANSPORT 


722 


TRANSFERRIN RECEPTOR 


723 


BLOOD PRESSURE REGULATION 


724 


Htfi'KRO'l'RiMERlU U-PROi'HlN G'l'PASE, BbTA SUBUNl'l' 


725 


ACETYL CHOLINE RECEPTOR SIGNALLING, MUSCARINIC 
PATHWAY 
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726 


PROTON TRANSPORT 


727 


VACUOLAR HYDROGEN-TRANSPORTING ATPASE 


728 


SODIUM/POTASSIUM-EXCHANGING ATPASE 


729 


SODIIJM/POTASSIUM--TRANSPORTING ATPASE 


730 


ANION TRANSPORT 


731 


MITOCHONDRIAL OUTER MEMBRANE 


732 


VOLTAGE-DEPENDENT ANION CHANNEL PORIN 


733 


APOPTOGENTC CYTOCHROME C RELEASE CHANNEL 


734 


ADENINE TRANSPORT 


735 


DOUBT H-STR ANDFD DN A BINDING 


736 


CALCTUM-TRANSPORTTNG ATPASE 




GLYCOPROTEIN DEGRADATION 


738 


HYALURONOGLUCOSAMINIDASE 

XX X i^x.^wxxwx^ v'vJx^w^^v^kj.rvLTXxx^xju'inLLjx^ 


73Q 


EXTRACFLLULAR MATRIX MAINTENANCE 


740 


SERINE CARBOXYPEPTIDASE 

tJJL^XxXl iX^ V^XXLxXJVyxV X X .1 iX X .1.1 y x^\JX^ 


741 


ION CHANNEL 

Xv^l^ K^xxr\x\L\x^x^ 


742 


ION TRANSPORTER 

jlXh/jl^ xxvxiLiikjx vyxxxxjixv 


743 


SENSORY PERCEPTION 


744 


PAIN SENSATION 


745 


THERMORECEPnON AND RESPONSE 


746 


CYTOSOLIC RIBOSOME 


747 


L-LACTATE DEHYDROGFNASF 

L^XoTW^ X X».X X^ X^XjXX X XyXX\_/\JXJi>^jrxOX^ 


748 


HETEROTRIMERIC G-PROTETN GTPASE GAMMA STIRtlMIT 

X 1 1 * X X Jix yy x x\JLXvxj»f i vi^^ vj xxxv^xx^xx^ \_fxx i^ux^j vjmvxxvxxx ovyxjwx^xx 


749 


RAB SMALT MONOMFRTP GTPASF 

xvrvu oivxfii J xvxwx ^ \^xvxxjxvx\^ vj x x i^ox> 


750 


RNA POT YMFT? A T TR AN^PPTPTTON FAPTOR 


751 


P-5 ^TFROT DF.'^ATTTRA^F 


759 


P ATA ROT T*sM 


75'^ 


PARBOXYI ESTFR ASF 


754. 


ADDIPTTON 


755 


VTTAMTNB12 TRAN*^PORT * 


756 


PHYSIOLOGICAL PROCESSES 


757 


VTTAA/rnsr t^to^yntttf^t^ 

Y X X rVlYXXi^ XJX\JO X JL^ X XxCOXO 


75R 


PAT PTTTM TON TTOMFO^TA^T^ 
\^r\u^x\jiyx iKjiy xx\jjyxxjf\JO x rvoxo 


759 


PAIjCTDTOI l-MONOOYYGENA^F 

V^^rvL^XXyXVyx^ x~xVx\Jxy\J\Jj\. X \J£Ui^X\\Jx2t 


760 


OOGENESIS 


761 


PYSTEINE-TYPE PEPTIDASE 

X VJ X J./I.l tIX^ XXX X^ X XJfl X XJ y J^fcJX^ 


762 


G-PROIEIN COUPLED RECEPTOR PROTEIN SIGNALING 
PATHWAY 


763 


PROTEIN TYROSINE/TEIREONINE PHOSPHATASE 


764 


DYNACITN COMPLEX 


765 


INTEGRINUGAM) 


766 


INTEGRA RECEPTOR SIGNAL SIGNALLING PATHWAY 


767 


NEGATIVE REGULATION OF HOMEOTIC GENE (POLYCOMB 
GROUP) 
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769 


SPECTRIN 


770 


OTTT T T~\T7 A rrvfT 

LbULDbAlH 


771 


1 /I .'11 T\ T''l' 

FERRITIN 


772 


JRON BINDING 


773 


PEPTIDYLPROLYL ISOMERASli 


774 


MICROTUBULE CYTOSKELETON 


775 


GAMMA- AMINOBUTYRIC ACID-INHIBrTED CHLORIDE 
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1108 


EXTRACELLULAR MATOIX GLYCOPROTEIN 


1109 


fflGH DENSITY LIPOPROTEIN BINDING 
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1 1 1C 


PAT rrrnvyr pwammttt 


nil 

1 JL JL 1 




1112 


HOMOPHILIC CELL ADHESION 


xxlo 


UAlX.lUM-lINlJilrrlJNL)iijN 1 LiiLJL-LliLJL MAIKIA 


1 1 1/4 

xim 


JCrJVJx) Jv I UJN lU r Ul^AKl 1 I 


1115 


MYO-INOSITOL:SODIUM SYMPORTER 


1 1 1 


rSiirlAVlUK 


1117 


CAIflMABINOID RECEPTOR 


1 1 1 o 

lllo 


G-PROTEIN SIGNALLING, LINKED TO CYCLIC NUCLEOTIDE 
bJciCUJND MlibbbNCjliR 


1 1 1 n 

my 


llLrill JUJNCllUN 


1 1 on 

xxZv 


MEMBRANE- ASSOCIATED PROTEIN WITH GUANYLATE 
isJJNAoli ACllVll I 


1 1 oi 


ppT?Tr^Pxm>Tr\T a t> a Tma tat 
rxlKli^JtllN 1 KJULAK MA 1 ERIAL 


1 1 00 
xiZZ 


PT^P AT A TMJP OTr\XT VTXT A OT7 

rUUAJL ALJxlciolUJN ivUN Aoii 


XX Ad 


c Tn\r A T f^cw/roi p y pop a >r a ttoxt 


1 1 0A 


TTPPA r^r'T P 


110^ 


A P ^TWTXrOCT Tr^r^TM A TP T V A CP 
AKVJllNliN Uo U l^l^UN A 1 ri i-r I Aoxl 


IIZO 


MTTPTPOTTnP CTTnAP A^TRT' APr\T TCA/T 

IN Ui^i-JiU i iUii-o U OAK MJti 1 AJtJUULo M 


1 107 


A/TRTiTATHP rTkAypPT PY 


IlZo 


PATTV APVT nOA QVKPTtTACP 
r A 111 - AL^ 1 L<-UL/ A o I IN 1 HAoii 


1 1 0O 
xxZy 


TP A VTC TDTTr^XT TT7D A JTVM A TT/^M 

IKAJNoUKJLrllUJN lliKMlJNAllUiN 


XXD\) 


TP AKTCOPTPTTOAT PT ATTTOAT PA r^r\X> O/^A/TDT "CV 

1 KAJN o uKir 1 lUIM liL-UPv U A 1 lUJN r AU 1 UK LUMP LJiA 


XXDl 


X>r\f TT TP A ATC01>TPTT/^AT TTCO AyTTAT A TTr^XT 13 A f^niy 

rULi 11 1 K AJN oCKir 1 lUiN 1 hKMlJN A 1 lUJN r AC 1 UK 


1 1 '50 


A CP A CP 1 


lljo 


rULrHAUr-KlbUbn; ULYCUHYDKULAoE 


lla4 


\7'OT T* A #nP CPATCTTTV7P OAT r^Tt TKA /^XJ A TVTVTTTr 

V Ul^ 1 AlaJl-MilN I> 1 1 1 V±i C ALLl UM CHANNEL 


XXDJ 


TP XT A /^TT A XTVT VT TTD A KTCT7t:?T> A CX3 

IKJNA UUAJN ll^iLilKAI^brEKAtSE 


1 1 

11^0 


fU VOOC A A^TMOOT V/^ A XT "DTXTT^TXTO 

LrL^iLUoAMliNUU-LYCAJN blJNDUNu 


XXD 1 


PP OTPTXT CT7P TXTC /TTTD POXTTXTD OTJOCTJTJ A T* A CT3 

rKU 1 JilJN oJDKilNrl/ 1 HKJiUJNlINE r rlUoPnA 1 AoE 


1138 


RHO GTPASE ACTIVATOR 


1 1 "ao 

lloy 


L I lU5>KiiLErON OKuANlZATION AND BIOGENESIS 


1 1 AC\ 

114U 


T^XT A T TO A TTOXT 

L>iN A LrlU A 1 lUJN 


ll*f 1 


AT "nPTTVT^P A4PTAPOT TCA>r 
AlJL'llrl I JJii Mo 1 Ai5UU.oM 


11/10 


AT T\C\ ITTTTO PTTTMTrn^ A CP 
ALJJVJ-lSJiiU KliUULlAoii 


1 14*5 


AT r^TTTA^ T^PPPXTTlPXTT T>TJOCT>*CJOT TDTTx T)TKTr\TXT/^ 

UAJL^L^lUM-lJJtM'liJNlJjlIN 1 PHUoPJlULJPiU BUNL/UNLr 


1144 


^TPTTO QPTTOTMO CTTOT POT VPTIO CPXJ A TP 
Ulr jlUorxllJlIN Uol 1 KJLt srKJL. i rrlUorxlA 1 Ji 

PHOSPHOHYDROLASE 


1145 


BETEROnUMERIC G PROTEIN 


1146 


T CELL RECEPTOR 


1147 


POLY(U) BINDING 


1148 


ACETYI^COA METABOLISM 


1149 


CYTOSKEDETAL ADAPTOR 


1150 


CAM-DEPENDENT CYCLIC-NUCLEOTIDE 
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PHOSPHODIESTERASE 


1151 


CYCLIN-DEPENDENT PROTEIN KINASE INHIBITOR 


1152 


SERINE BIOSYNTHESIS 


1153 


PHOSPHOGLYCERATE DEHYDROGENASE 


1154 


TRYPSIN 


1155 


MITOTIC S PHASE 


1156 


G-PROTEIN-COUPLED RECEPTOR PHOSPHORYLATING 
PROTEIN KINASE 


1157 


OXIDATIVE PHOSPHORYLATION 


1158 


SMOOTH MUSCLE CONTRACTION REGULATION 


1159 


METHYLMALONYL-COA MUTASE 


1160 


CHEMOSENSATION AND RESPONSE ' 


1161 


CARBAMOYL-PHOSPHATE SYNTHASE f AMMONIA"! 


1162 


NUCLEAR RNA-NUCLEUS EXPORT 


1163 


EMBRYO IMPLANTATION 


1164 


SMALL RIBOSOMAL SUBUNTT 


1165 


ENDOPLASMIC RETICULUM RECEPTOR 


1166 


DNA BENDING 


1167 


INTRACELLULAR 


1168 


PROTEIN SYNTHESIS INITIATION 


1169 


ION TRANSPORT 

XX^X 1 X X^4 XX 1 \mJJL X-^XX X 


1170 


GUANYL-NUCLEOTEDE RFLEASTNG FACTOR 


1171 


SPLICEOSOME ASSEMBLY 


1172 


ADENYLATE CYCLASE 

t, ' m ji ^ X JL*« X X X-l X V_xJw/J xV^JL^ 


1173 


CALCIIJM/CALMODULIN-RFSPONSTVF ADFNYT ATF 
CYCLASE 


1174 


NASCENT POLYPEPTIDE ASSOCIATION 


1175 


NASCENT POLYPEPTIDE- ASSOCIATED COMPLEX 


1176 


NONSENSE-MEDIATED MRNA DECAY 


1177 


ACYL-COA METABOLISM 

* X^^ X X^ X^ < X X^XX^ X X XX^ 1 i 1 1 r XTX 


1178 


DEGRADATION OF CYCLIN 


1179 


CYCLIN SELECTIVE UBIQUmN CONJUGATING ENZYME 


1180 


STEROID BINDING 


1181 


GLYCOSPEONGOLIPID METABOLISM 


1182 


MULTIDRUG TRANSPORTER 


1183 


ORGANIC ANION TRANSPORTER 


1184 


ASIALOGLYCOPROTEIN RECEPTOR 


1185 


NUCLEAR INNER MEMBRANE 


1186 


RIBONUCLEASE INHBrrOR 


1187 


CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE 


1188 


NEUTRAL AMINO ACID "ERANSPORT 


1189 


NEUTRAL AMINO ACID TRANSPORTER 


1190 


TRANSPORT 


1191 


CALPAIN 
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1192 


PROTEIN PHOSPHATASE TYPE 2A CATALYST 


1193 


MEMORY 


1194 


MRNA SPLTCE SITE SFT ECTION 

iVXXVL^/^ kJX X^X\^X^ OXXX^ 1 .1 ji A - 1 ■ V y 1 ^ 


1195 




1196 


DRUG RESISTANCE 


1 107 


JVXi^X-ft3Xi^ 




iVXXV-^X\.v/ X UxJ UX^ iVXVJ X KJjS. 


1199 


NADPH-FERRIHEMOPROTEIN REDUCTASE 


12UU 


MKInA rJilNDJLNCjr, i UIK 


1 oni 


CUAlUMiSK 


1 Of\0 


bUJvAKxUxlC xKAJNoLAixUJN JLiNlllAllUIN r x UK zALrxlA 




JaJDAx KJtiorUINoil 




PTJOTTPTOP CinXTAT T TXin PPriTT7TTsJ QT7PTMTw/TT4T3PnKrTNrP 
KJi^^EJr X Ulv oXOiNAI .1 ilINO Jr JK.\J X JdXlN olilNJLlNIj/ 1 xlKJivJiNXiNIl 


XXUJ 


TsJTTpT pAQp 


190^1 


PMA A/rnnTFTTPATrnN 

I\JL\/\ Vfl^^LJjJrj}^i\xx\JLS 


1907 




1 905? 
X^Uo 


X IxS-V^oXi^Xl — XxSJL^lri X^XVJ/\OX^ 


1909 


TYROSYL-TRNA RIOSYNTHRSIS 

X X XxV^O X X^X JVL^/x XJX\-/kJ X i.^ XXXX.<wXO 


1910 
x^xu 


INTFRT ETTKTN-8 RFPFPTOR LIGAND 


191 1 

XZrX X 


PT TRTMF ^ AT V AGP 


1 919 
xzxz 


n 1 X wA/Vli XxXIXiJu X xxwOx Xlv-'JNJLDvyO X X,^Xx\^AXNOX^XjXvfVOxi 


1 91'^ 

X^X J 


FTFMR OWfrFNA^F H^FrYPT T7T^^^r^ 

rXCaVJJC/ v/jfV 1 vJX-«l 1 r\OXi \LJd\^ 1 v-'XjIZjIJ.N VJ ^ 


1914. 
XZXt" 


xjnj^ia j.vxv>'X-/xrx^/\ X Xv-/i>i 


1 91 S 

X'^X J 


riXO X yJrid iVlKJ,>I A IVXC x /vd^x-.xoj.vx 


1 91/^ 
XZXO 


MAP'K'Tnr PAQPAFiP 
iVlAx JSJSJv o V^rvL/Jj* 


1917 
xzx / 


FPTT PPPFPTHP <2Tn"Nr AT T TNTfr PATRWA Y 


1 91 R 
xzxo 


FHTRPOPT A^T nPOWTH FAPTOP PFPFP^TOP 
jrXxjxx^JjXjAo X vJXVL>' W X xl Prvv-- X \J1\. ISJZX^dx X \JS\. 


1910 


P A9 frTTANTYT ^NTTPT FOTTDF FYPTTANfrF FAPTOP 


1990 


PTTO^PFTOnT YPFP ATF MTTTA^F 

X xxV/Ox XXVyVJXj X V^Xj>XVr\ X XJr iVXvJ X /^ox^ 


1991 


GT TlTATTrrONF TR AN*5FFRASF 

\jX-/w X /\ X xxiv^x~i-( X Jajtvl^ox i-*rvr\oxi 


1999 


TTFAVYMFTAT RF^TSTANPF 


1223 


HEAVY METAL RESPONSE 

XXXJXT V X XYXX^ X xV.L^ XXXJfCX \JX^\JX^ 


1224 


HEAVY METAL TON TRANSPORT 

XII ifTL Y X XVXX^X riLXj XV^X^ X XVTxX 1 kJX VyXX X 


1225 


COPPER ZINC SUPEROXIDE DISMUTASE 

v^x^x X xjtXxf t t.i I ^ v^ L^x x^xvx^/yi 1 1 ^x^ x^Xhjxvxw x i^tjx^ 


1226 


CYTOPLASMIC VESICLE 

W X X \^X Xmm,%JXtXX^^ T JL^^XV^XjJL^ 


1227 


CELL ADHESION INHIBITION 


1228 


RHO GDP-DISSOCIATION INffiBlTOR 


1229 


CELL FATE SPECIFICATION 


1230 


CILIUM 


1231 


MORPHOGENESIS 


1232 


PHOSPHATIDYLINOSrrOL TRANSPORTER 


1233 


COCHAPERONE 
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12J4 


oncT r'TiAPTrpnxrrwTi tttrttt TM FOT DTNG PATHWAY 




D"D r^TTn TM A CP TNTPTTRTTnT? 
rKU 1 liJIN AoJti JUNxlLDll \JX\ 


Izio 


LTD A \ TV XyirrT AT THM TP ATsJQPm?TPP 


1 Tin 
1237 


LAC 1 Abxz 


1 0 

IZoo 


rriUor rlUrK U d U JSJiN /Von 


1239 




124U 


bnr i Alii J UiNCllUJN 


1241 


r^T7T T /^T7T T A TMITCPTTKTC TTTMr'*TTO^ 

CxilJ-.-CJiLL AL)iltlKJtiiNo J UXNU i. vJri 


1242 


TXl'l'L'Pr^T T TTT AP TTTNJOTTn'M AQQP"\/rRT Y 


1243 


LAC 1 U YLCjLU 1 a 1 xllUJN Ji L i AoJd 


1244 


HYDROXYMblHYLoILAINii o iJNlxlAoli 


1245 


DUTP P iRUPHOorriAl Ac>ii 


1246 


PROTEIN FMUbrliAl Aoli 1 irli ZC 


1247 


f-K TTTTD TTDTJ i^XT A T "DXJ A /DTTT* A P"Pn7PTT^P 

[NT. iiKriiKUJN - ALirllA/i5±l 1 A KcCiir l UK 


1 O >l o 

1248 


GL I CUN Jtl Mb 1 AdUULo JVL 


1249 


DlUi ACID JNlcl AdUIJLoJVI 


1250 


AT>"Vrr CT TT Tj A A CD 

AR I to UJLr A 1 AdiI 


1251 


LYoUoUMAL IKAJNorUKl 


1 oco 

1252 


TjrvT^"Dr^r^T7Kr tt^ axtct nr* A TTNin V TYPT? ATPA^P 
rlYJJKUCrilJN-lKAiNoJ-*UL^AlJuNU V-1 irSZ r\irr\Olj4 


1253 


TTr7X /rc TO A XTCO/^P TCP 


1 Off yl 

1254 


or "VOi^/^TTXT OtJOCPTJOP VT ACT? 


i o £r 

1255 


CREATINE TRANoPUKibK 


1256 


NEuRUlKANoMll liiK UrlAJSJi 


1257 


OT>T3 A TTVTD.C/^TMTTNyr CVTVAPOPTtTP 

CREA 1 iINE:oUi->lU M o I JVLrUK 1 rUv 


1258 


EICOSANOID Mbl AdUJUIoM 


1259 


A r /"ITT TK K T\T7T>Tn^Tr\TnvTT /^VTT^Crf^T T/^ PUr^CPWOT TP A QT7 A 1 

CALCIUM-DEPEiNDEN ICY lOoUliC rrlUorxlUJ-rir Aoii AZ 


1260 


ENDORIBON UCLEAoE 


1261 


a T T\T7TTVT\T? /^"VTPV a CD 

ALDEHYDE UXlDAoE 


1262 


XANImNE DEHYDROCjEN Abb 


1263 


/-\'vrv/*^T7XT A XTr\ Tl AT^T/^AT TV ITDT A "D f^T TC\A i* 

OXYGEN AND RADICAL Mb 1 Ax> ULJoM ' 


1264 


BLOOD GROUP AN llvjbN 


1 Oil c 

1265 


r\wn"Dxr td axtcpoptdp 
OXYGbN IKANorOKlJciK 


120D 


r^wn'DKr tp aktqppiPT 


1267 


XI KI'DTT A CI? 

NllKlLAob 


1/00 


O'DKITM 
IvnJNiLN 


1269 


T^KT A "n A A/T A CVQ TlSTnT THTTn PP nTFTTM PTTO ^PHOP YT A Tl ON 


1270 


bLAo 1 Oi-'JCrKM obvJiVliitN 1 A 1 WJvi 


1271 


MEMBRANE PROTEIN ECTODOMAIN PROTEOLYSIS 


1272 


SPINDLE MICROTUBULE 


1273 


ANTEROGRADE AXON CARGO TRANSPORT 


1274 


ORGANELLE ORGANIZATION AND BIOGENESIS 


1275 


GLYCIPAN 


1276 


EXO-ALPHA-SIALIDASE 


1277 


MANNOSYLTRANSFERASE 
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1278 




1279 




1280 


frPT- A NPWnP TP A NTQ A "MTH A 


1281 




1289 


MTKPT P POKfTP APTTmsJ PPHTTT ATrnM 


198'^ 

XAtOD 


^YNAPTTP VP'^TPT P 

O X IN/vT X XK^ V IZrOX^X-tD 


19R4 


MPTTPHTP AMQA/fTTTPP T?TTrPACP 


198^ 






m TTTAPPr>nYTxr 


19R7 


APTn PWnCPTJATA CD 


19SK 




19RO 


5TTRCTP ATP PPlTTMT^ PPT T TV/imO ATTOXT PUT T PVTTTMQT/^XT 


19Qn 


TMTP A PPT T TTT AP PP^PPPD TvPT T\7PT3V 
XiN 1 JtvAV^cJULU JLAK V^Ur JrJtiK JL>ni «i VxiK I 


1901 


PATPTPPCTM 


1909 


r PT TK'nTP TPNTP A/TPfTAPPlT TQXyf 
X^U J\.V^ X JKJiiiNJD iVXD X /wVJJLiXojVX 




QPPPA/TrnTXTP CVXTTUACP 




PHT VAAyTTMP A4PTAPOT TQAA 
xKJxu I AlYLUMl IvXXi X AxsCX^XoiVX 


190^ 


MPTTTTnTsJTMP A/TPTAPPiT TQA^T 
iVXXl X rXX^^iN XIN H IVXD X /\J5 vyXJ.o IVX 




PPPiTPrW PHPQPWATACP TMTTTRTTnP 


1907 


HP ATW PPPPPTHP T TnATsm 


190R 


A PPiPrnTTP X/fTTPPprnNm"D t a t pit a Kmp c 

I\x\Jr X U X XL- JVU 1 L^L^JnlUiN JJKlAJ-i CxlAIN VJiio 


1900 


FMTiT TPTTPlXr HP A POPTPl CTC \n A T^P A TTT PiOAA A TKT X> PPPDT'OD C 
UNXJUl^XXUXN Ur AJrUrlUolo VIA JUxjAIxl IJUMAJUN KxlCxlr 1 UKo 




\APfcT PPT TT A P TTT TMPTTPHvT 
mSJlJZA^ UX-iAJK._Jr UIN U X XUJN 


i^ni 


r»T VPPPrvPwnQPTTOT tptt^ x/rPTAPpkT Tc\>r 




1 PTIPlQPPTATmVT TMPCTTr^T A DTJOCPTTATTP. XTTXTA CX7 




XA" A T A TR nPTTVT^P OPPXT A CP 


1 'XCiA 


PAT PPT\>f r%PDPKTT^P\rT PT7T T A "rkTrDCTrf^XT 


1 '^fK 


TUTPiPT TP TMP C A/rPTTTVT TD A TVTCTTCP A CD 




PXJPk CAA ATT AAPTVlPAyTDPTrf^ mPP A CD 




AFlP PTTT^TTPTTOKT 


JL JUO 


MTTPT PnTTTiP AvrFTTAPnT TQlV/1 
iN U V-X-iD^J X XL^I^ IVXD X A-D vJX^oIVX 


1^00 


PfRONTTPT PPQTnP ■nTPTTPkCPTJATR P ATA POT TCAVT 

iNXDvj'iN u\^iJiUoxx-/xi x-'XriuUorxlAlJci UAx AJ5UX-J.0JVX 




PfRO'NrTTPT PA^P 


XDXX 


FATTY APTT) PTNTnTNTPr 


1312 


MOLECULAR JFUNCnON UNKNOWN 




OPP"\>r PPT T T^PVPT nPTWrPTXTT 
OXjJxIVX K^iJti t\ t xJxlt V CtLlJrlVlr'JN X 




1?PPPP Q QTOlSJ HP CT TP \rf\T A T nPXTP PP f\TS\ TPTC 
SSJ24xS\xliOOx\JxS yjr 0 Uiv V X V AL» LriiiNii rssSJxjyjK^ X 0 


1315 


t)TADENOSTNE POLYPHO^PTT ATP P ATAPOT T^M 


1316 


EXONUCLEASE 


1317 


UV PROTECTION 


1318 


DOUBLE-STRANDED DNA SPECIFIC 
EXODEOXYRBONUCLEASE 


1319 


EOF RECEPTOR MODULATION 


1320 


AXONEMAL MOTOR 
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AYONFMAT DYNFTN 


1099 


PPOTFTN PTTOSPHATA^F 

JrXVW X JdXl^ X XIWOX XxTil -TVOXj 




TNOSTTOI7PHOSPHATTDYLINOS1TOL PHOSPHATASE 

Xl^V^kJXX K-^XJ X XX\^\JX XXr\X XXy X X^JLLl \^kJXX \^X^ X XJ^\y\JX ^JU^X^x^^J^ 




OSSTPrrATTON 




GT T JPO^AMTNE PATAROLTSM 




GT urns AMTKF-6-PHO*5PH ATE ISOMERASE 

vj X-/ \^\J \~y xxiyxxi. > x./ \j x xx\jijx xxr\ x jo jio v-/j.Ti_L_»j.vriij j_j 




MTTOTTr VTRTAPTTA^FVANAPHASE TRANSITION 

iVXXXv/XXV^ IVXXiX /vrXX/\OXj//lLL^fTLX XXiTiOX-f XXVrVl^OXl XV-Zli 




Od.l-tI2tXS}\Jiyl Dxl\xJxl^\J 




G/T.MT^lMATrTT-^PPrTPTr TTTYMTNF-DNA GT YPOSYT ASF 
vjr/ X "iVXXOiyxrt. X ^xx ox Xi^xrx^ i xx x xvxxiNXi x/i.^ri. vjx^ x x Xj^tvwx^ 




lS KJxS -Oxll-div-*. 1 X V JC» V x20Xl^l^JD xJ *^v^ JMLLN VJ 


1 '5'^! 


TMTT? A GHT TP AlSJ<2PnPT 






xDjD 


P A ^ AT T MOlSjn'MTTPTP GTP A 9F 




xxxVJoJrJriVJlVlDV AJ-»V^iNA J^JLLNr\oxl 


XJDD 


PPHTFA^nA^T? APTTVATDP 


1 '^'^^ 


TWVMTnVT ATR TTTNA W 
i XX I iVxJLL/ X JLir\ iH JSJUN AoC 


1^3 / 


PPn<2TAGT ANnTNMTTTAPiOT T^M 

xiSXJO 1 AVTJ-iTMNX^XiN IVxC 1 /Vx>V/X-flOiVX 


i JJO 


^TNTGT F-^^TP ANTv PPFATC PT^PATR 


xjjy 


PTTO^PFfATTDYT jPTTOT TNF TP ANSPORTFR 




NFTTTTPnTR AN^MTTTFP 9YNTTTF^T9 AND STOP AGF 


1 Jtl 


TP AN^ AT DOT A^F 


1'^A9 


^VNAT>TnNFMAT POMPT FY 




nTWVTiP OT TPO AMTDF FiFRVDR OGFN A SF 




PATAPGT TP PAPPOTTVnP ATF MPTTAT^OT T^M 




nPATTTPPPPPrnP AQQnPTATFT^FAPrnP 


XD^O 


nrvTipniTPM tp atsj^poptpp 




PHTTNJVT AT AMTXTP TWTPTAPnT T^IV/T 


1348 


TETRAHYDROBIOPTERIN BIOSYNTHESIS 




A A XJVTiP r\ Y VTITTD ATTVIPlPOTiTnPTTTPTNJ "nTTTTVTiP ATA 
4 A-xx X iJKLl A X 1 XI i KAXi x ]Ji\\Jx>xKJr X XiKXlN JJXixl 1 JJKA 1 AoXl 




A T A r^TOXTTlvT A C"C? 

UAJLAL i UJsJLlN Aoii 


1 Ki 


vjALAU X \JoCi iViC X AlSUXJLolVX 




PTQr^* 'KnrTPT phqvt ^ tutp apwh^pwata w ^'^vmmptptp at ^ 

I^lo^D -IN UV^XJiVJo I Xjr^- XXIX IxAx jnL^OX:^xl/\ X AoX3 I IVXlVXIiX XsJ.V^/\X-r^ 




inKlTP TTsTQTTT ATTmsI PlP MPTTPmsTQ T^V GT TAT PFT T ^ 
XUINXV.^ XINoUXjAXXVJIN yjSr lNIiUl\.L/INO 15 X VjX-«LriJ-* k^Xlljl^ 




Tvpp 1 wPT^^P/TTTPF^^sJTN^P cippptfiTP ppotftnt 

X IlTxl X oxilNJINrV lijLlSJLjpVrlNXiNxl OXrllV^XirXv^ JT IV^^ X X-»iIN 

PHOSPHATASE INHIBITOR 

XXXV/OX XXTx X X^iJX^ XX^XXXXfXX \^XK. 




BTOLOGTCAL PROCESS UNKNOWN 


X*7«/l./ 


C ATHEPSIN H 


1357 


CASPASE-ACnVATED DEOXYRIBONUCLEASE 


1358 


ACYLPHOSPHATASE 


1359 


ACYL-COABINDDNfG 


1360 


PROLYL OUGOPEPTTDASE 


1361 


GROWTH FACTOR 


1362 


PHOSPHATEDYUNOSrrOL-BISPHOSPHATASE 


1363 


ION CHANNEL INHIBITOR 
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1364 


DIHYDROBIOPTERTN REDUCTION 

JL-'JJL Jl X X^XW-/UX\-fX X X^XvJLL ^ AVI—dU' \-f X JIWX 1 


1365 


DIHYDROPTERTDTNE REDUCTASE 


1366 


DIAZEPAM-BINDING INHIBITOR 

iL-^ XJr\J mLjX xaXVX XJXX^X.'XXiVJ IIt^ Ii'*' ' * ■"^ 


1367 


GALACTOSE BINDING LECTIN 


1368 


ORNITHTNF MFTABOI ISM 

wXVl 1 X X X XXi ^X-« IVXJL/ X Jl\Xj WJ_aX01VX 


1369 


ORNlTHTMF--OyO-ArTD AMINOTRANSFERASE 


1370 


PATHFPSTN B 


XD 1 X 


RTT T? Arrn tt? an^phrtftp 


xD 1 


1 JT O X IN 1 TxtxoTj 


XD ID 


^nRRTTOT MTTTATinT KM 


xo /'*■ 


TTRTnTTTTTN T TTTP APTTVATTNTrr FN7YMF 

UxjIv^ Ul 1 ilN-l^iiSJCi £\\^ 1 1 V A 1 IIN Vj X-»Jl T JL» i 1VJLC# 




nTTTYnPOT TPnAMTnT? ^-APFTYTTR ANSFFR ASF 
Uxxx I l^xvVJl^Ix L/ rVlVJ WJi!* O-AV-^-Cl 1 l-»lx\./\iNoriixVrt.Oljf 


10 /u 


PHOSPHOR YT ASP 


T^77 
xO 1 1 


OT TTTAMATF PATABOT ISM 


10 / o 


PPTTPTOSF 9 ^^-RTSPTTOSPTTATF MFTABOI ISM 

riVU^l \^i3jJr ^,U~X>iOJrxxV-/Ox xXr\XX-t iVXXiX/VDV^J-iXiJiVX 


10 ly 


FRTTPTOSF-9 fi-BTSPTTOSPTTATF 9-PHOSPFrATASF 
riv uv--^ X ^^ox-f"ir,i>"i>iox xxv-zoxr xx^^i xis^ox^xiri x /^oxji 


xDO\J 


^-BFTA-TTYDROYY-DFT TA^^VSTFROTD DFHYDROGFNASF 

D XjCj X xTl XJL X XJSS\Jx\. X -L/.t.li,l^ X /^V.-' ) ^ X X-<XVWXJ->' X-/X-»A1 X X-/XWy\JX-<X^rtiJX-< 


XDOX 


AT PTTA DMA POT YMFR ASF'PRTMASF COMPI FX 

r\JLJr XXXx JL/X~X^ X ^X^X IVXXJJLVriLOXJfiX XVXXVXi^iJX^ ^VyiVXX JUULt^V 


lOOZr 


APONITATP TTYDR ATASF 


lOOO 


MTTOmONnPTAI T ARGF RIBOSOMAT^SIIBTTNTT 


I-JOt" 


MRNA FTiTTTNG 

JlYXxvXN/\ Xji«/XXXXlVJ 


lOOO 


rvnospoRTN a bindtng 

V-*- X \w>X-A^Ox WX^JX^ t\. XJXi>iX-'XX> VJ 


lOOU 


MFV AT ON ATP TR ANSPORT 


lOO / 


MFV AT ONATF TR ANSPORTFR 


lOoo 


MOMOPAPROVYT TP A PIT) TR ANSPORT 


10O7 


M01SJ0PARB0YYT TP A PTH TR ANSPORTFR 


lOi^u 


r"JZJUx2fIVlx21>l 1 ljli>IJL/lINVJ 


lOi^i 


AMTNOAPYT ASF 


loyz 


A^PARTATF TRNATTGASF 


107O 


ASPARTYT^TPNA BTOSYNTTTFSTS 
riOlr/\xvl Il-f-llvLNrV X>1^0 XINIXxJCajIO 


loy^f 


POT YPFPTTDF N-APFTTYT GAT APTOS AMTNYI TR ANST7FR ASF 

r V-'X-i X X Xix 1 IXJJCi rH~rW^x2i 1 X XAJ/1ULrf/\\-* X V-/OiTXVXXlN X X^X X\JTd>OJ/X-AtVr\LjXi 


lO^*/ 


POT T AGFN 


1396 


HYDROGFN/POTASSTTIM-EXCHANGTNG ATPASE 


1397 


SARCOGLYCAN COMPLEX 


1398 


FUCOSYLTRANSFERASE 

X w\Mfvy^ X x^ X xvrxi 1 vjx .i jj vjrvtjx-* 


1399 


AMINOPEPTIDASE 

iJULVXXX T V-'X XJX X XJ^XlurjL.i 


1400 


UDP-GLUCOSE 4-EPIMERASE 

KJX^x vjx^iw' ^vyux-/ ~ x^x xxYxxjfXXjn.u j-« 


1401 


TOANSPORTIN 


1402 


METHIONYL AMINOPEPTIDASE 


1403 


DPEPTIDYL-PEPTIDASE 


1404 


PITRILYSIN 


1405 


b',3'-CYaJC NUCLEOUDE 3'-PH0SPH0DIESTERASE 


1406 


toCLBAR OUTER MEMBRANE 


1407 


IMP DEHYDROGENASE 
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14Uo 


'^xLr 1 JLUJtl JVill 1 AJoVJi-rloIVI 


1 /I no < 


^xIUxCD 1 V^lv I V Hol^i-iD IVJLTMVJUDIvrSJ^ 


141U( 


3L» X UUNli— 1 KIN A 1J.Vj/\oJD 


1411 


LI L" 1 'L.'p r^nrp T\>nTP ir* n PunTPTM HTPA^F AT PTTA STTBUNri" 
HKi tRU 1 ixJLlVLcKiC vjr-i xv\Jli2.iiN rU-Jrxiri. ouajwi^xx 


1412j 


A T>'n r^HTD A CD A r^TTA/" A TOP 


t At ^ 

1413 


I'EKUXloUMAL MAIKIA 


1 y| 1 /I " 

1414 


vrtj'D V T oxin n^'ATxr "RATTV Arrn "N/TRTA'RnT T5sM 


1415 


A T TITT A /^T TT/^/^CTr\ A CP 

AXJ^HA-OLUCUoiUAoxi 


1416 


;'HAGOCYTOSli> 


1417 


CREATINb 131Uo i JN incolo 


1418 


GLYCINE AMIDiNOlKANorJbKAoii 


1419 


MAP KINASE 


1420 


CALrLlUM-ACllVAlxiL' rUiAoMUivx i^iimNiNXiJ-* 


1421 


dLEOMYCUN MYDKUJL/Aoii 


1422 


A X iiTVTr\T5T 7T^VT> A TTJ O A T A "DOT TQlVA" 

AMlNUJiU 1 iKAlii l^Al Al5UJL#xoivi 


1423 


o UCCJJN A 1 E-oJiMlAJLUiixl x Uxi UJtlrl i JJJ\u^jrx:»iN/\oxi 


1424 


jYoUoUiVLALr JVUilVlDKAiNil 


1425 


'^xiAUUoUJyLc rUKMAllUlN 


1426 


JEliKAJNCxllJNvj xiiNZ#x iVUi 


1427 


T TT' A \ 4" A TTJ TlTJTJVT^D On'RM A ^"R 

kjUJ 1 AMA 1 H JJiixl i JJKULriiiNAoJCf 


1428 


SPHINGUiJLrlD JVUil Ai5UiJ.oM 


1429 


SERINE C-P ALMi i U i JL i KAiN brxiKAoii 


1430 


Dxrri/^orwyrc TO T VCOQOTV>rc TR A MQPOPT 


1431 


CFT J . GROW T 11 AND/UK MAUN iiilN AJNV^Ji 


1432 


VT A T^rrwf . \ Tno ATvTCTTvrriPO*n'nKrACT7 (XI QP"pr^l"Wir"*"\ 

nAD(P)(+) TRANoH X UKUurJNAbii ^r)-orxiv^iri*^; 


1433 


\g K fr\Tt TTTOT^rM^O'VirD A TTPTT TTV PTTPTTnTT TP AtJ^POPTPP 

VIAJ OR HISTOCOMPA 1 IbULx 1 x r xir 1 Jl^xi i katn or x xiis. 


1434 


AMINOGL X LAN JiiUo x IS 1 nxiolo 


1435 


ACETYLGLULOiSAMIIN iL,lKAJNorJcKAoii 


1436 


UBIQUINOL-CYTOLHRUMji-L KiiuULlAoii 


1437 


A T-\T?TvTvrT A TTI? /TVOT A CT3 TMXTTD TTTOTvT 

ADEN YLA 1 E C Y CJLAoli UN JnULbl 1 lUiN 


1438 


GLUCONEOGENESIS 


1439 


T>kTm A T> "r» /~\"vr'\7T T/^ A /^Tr\ TL> A XTCPOPT 

DICARBOXYLIC ACID IKANbrUKl 


1440 


i-\T*^ Ann rf^W/T T/^ A /^TPk TD A XTCDOP TI7P 

DICARBOXYLIC ACID IKAJVorUKlJtlK 


^ A A-^ 

1441 


/~t A n"r»TT7n 

CARRIER 


1 A A^ 

1442 


rNON-oELbCllVJi VJtiolLlJl rUoiUiN 


t A A^ 

1442 


\ >rrrooilOXrr\P TAT OTTP A TT7 TP A M^POP T 

Mil OCHONDRIAL* CI 1 KA i Jtt 1 KAIN orUK x 


1441 


c/^TMTTXA'.T^TO APPOWT ATT7/TPTP AP"R0WT AM''F 

SODlUM:DxC AKJtsU A 1 1-*A 1 Jd/ 1 KlUAKDUA X J-«/v X xi 
COTRANSPORTER 


144f 


SUBTILISIN 


144e 


i PROPROTEIN CONVERTASE 2 


144' 


' SERINE-TRNA UGASE 


144S 


METALLOEXOPEFIIDASE 


144S 


HOLOCYTOCHROME C SYNTHASE 


145( 


AXON GUIDANCE 
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1 A^l 


PP nTP/TM VTPMTIP A KTP TAP fTpTTWri 
Jr JvvJ I HJiN -xVlXilVJLD Ix/vlN Ju» JL r\J\\JX2i 1 iiN 


1 






PnTV/TPT "PTV/rPTSJT APTTVATTHM 




nYOni TTTAPATTTT^PTTVTlPnrrPXfAW H TP^A■MT^P^ 




PnTAC^TTTlWf'PWT HPTHP QVMPnPTRP 




HT VPTTsJP PATA"RriT KJM 
vJJLt I l^liNll V^/\ X AXSV^X-XoIVX 


1 

IH-J / 






OT TnnC ArT'UAPTnTT \4THTAPnT TCA>f 
Ui^lLrUoA^i^rlAKiUli JVliil AIjvJI^IoM 




MAxNINUo I J^-Ul-iLrUoAL^UllAKlUXi l,Z-AlJrJ±ft.-iVXAlNiNlJolJuAoXl 


1 AAA 


XN-tTl^Il^AJN rXUV^lioolINLj 


140 i 


TTTP i^T TT/^OCT7 1 DXJACDXT ATT? TTDTPiVT VT TX> AXrCU'U'L^ A C17 

U 1 j^--ijLrU C^Uoii- 1 -r rlUor HA 1 li UKIU i L» i L, 1 KAIN o riiJcCAoli 


14oZ 


C\ \A L/'I'LIVT TTD A XTCT7CD A C"D 

U-Mrl 1 ii y L 1 K AJN orhKAoii 


1 yl^'i 

14o:> 


C /^T T TTT7 . A T^r^XT C VTV 4T>PlP "TCP 

oULUlli:UAllUiN oiMrUKlliK 


1404 


1 1 Uv^ilKUJVLti 


140D 


OYOr^T T TT A P A TT7 A/T A T ATT? A'NJTTPnPTTTP 
L/AUOL«U 1 AKA X cJNU\LiI\ X Jci AiN X XriJJK. X JDjx 


1400 




1 Al^7 
140 / 


X iJl^XlKUlVLli 15 


AH-OO 


PVPTTV/' ATP TV/fPTAPHT I^IkA 


1 A^Q 


TV/TAT ATP riPWVTiPnnPMAQP /TiPPAPPHYVT ATTMn^ 
iVXAXjAXH X-'UXl I X-'XS.VJOijiN AoH |^X-/Xl\^AJvDUA I XjAX ilN 


1 A7n 
14 /U 


PPATPTM VTMA QP r* 
rKUXXiliN JSJJNAoIi C 


1 A71 
14/1 


PAQPACP AOTTA/ ATTHKr \7TA PVTOPTTPn'MP P 
CAor Aoli AU 1 1 V A X lUIN V lA V-^ I 1 UC-XlKUJVlIl \^ 


1 AT> 
14 /Z 


TTT^P ni TTr*riCp-nT vonppnTRTM nj Tir^nQVT tp antqppp a cp 


1 AT^ 

14/:) 


A rmV/ A TinxT op Ayr a p/pp v tttm a cp tttnt a qp 

AV^llVAllUiN Ur MAr/XlJKJKw JSJiN Aoll JsJUNAo 


1 A7A 
14/4 


r^ATAT ACP 
UAlAJjAoJti 


1475 


INSOLUBLE FRACTION 


1 Ann 

1476 


MITOCHONDRIAL TRANSPORT 


1477 


GOLGI LUMEN 


1478 


ENDOCYTOTIC TRANSPORT VESICLE 


1479 


SODIUM DEPENDENT MULTT/rTAMIN TRANSPORTER 


1480 


POLY-GLUTAMINE TRACT BINDING 


14ol 


IkANSMEMBRANE receptor protein SERINE/THREONINE 
KUNAoli 


1 AR9 


lKAXNoivltlJVU>KALrNiliKxlCl^ rKUlxlilN ollKJJNiv lllKJiUiNliNl^ 
TTTMA W CinNFAT T TMH PATPTWAV 




nPTTfi TP AXTQPnPTPP 




OTPrP OTn "HPT T A T^OMPP A W 
O LSjJSxJXXJ X-'XIXjX A-XoV-^iVXJUiJxAOX-lf 




PWn^PHnPYT AW K'TMA^P PPfrTTT ATOP 


1486 


FERREDOXIN-NADPf+1 REDUCTASE 


1487 


PROFEJN BINDING 


1488 


AlP-DEPENDENT PEPTIDASE 


1489 


EPOXIDE HYDROLASE 


1490 


LEUK0TRIENE-A4 HYDROLASE 


1491 


MANNOSE METABOUSM 


1492 


MANNOSE-6-PHOSPHATE KOMERASE 
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1493 


OTJT T W7 ATT 

LELL WALL 


1494 


N-ACETYLGLuCOSAMlJNJhi Mhl ArSULloM 


1495 


N-ACETYLMATsNUSAMiNb Jyuil AiiULioM 


1496 


CENTROMERE/JONJElUCHOKb LUMrLUA MAI UKAliUJN 


1497 


3-BriDROXYlSUBUT iR I L-UUA HiDKULAoJb 


1498 


PYRUVATE DEHYDRUCjxiN Aob (LIrUAMiUxi; 


1499 


/-^j^T\T\ /^T\/^T>T\TTV7Tr>TXT/^/^T?XT /^"VTTN A O'C 

COPROPORPHYRINOGEN OXIDASE 


1500 


EXOPEPTIDASE 


1501 


GAMMA-GLUT AMYL HYDROLASE 


1502 


T>*.rf^TTT^T "r-* o^ni > A XTTNT~?TN T^XT A A TXT^XT^^OTXTD TN'O A X iTTXT A OX? 

DOUBLE-STRANDED RNA ADENOSINE DEAMIN ASJi 


1503 


ETEL\NOL OXIDATION 


1504 


ETHANOL METABOLISM 


1505 


ALCOHOL DEHYDROGENASE 


1506 


ALCOHOL DEHYDROGENASE, ZINC-DEPENDENT 


1507 


ANTIMICROBIAL RESPONSE PROTEIN 


1508 


BRANCHED-CHAIN AMINO ACID AMLNOTRAN5>riiRAbE 


1509 


TNTx A xT^^-rr Tf T'-v /^TT A TXT A X /TTT XT' A X JTTXT^ A r^TP\ TiTl^CWV 1 'LTDCTC 

BRANCHED CHAIN FAMILY AMINO ACID BlUb i JN lilbi>lb 


1 ^in 

J. JlU 




1511 


3-OXOACID COA-TRANSFERASE 


1512 


PHOSPHATIDYLINOSITOL BIOSYNTHESIS 


1513 


PROTEIN PHOSPHATASE TYPE 1 REGULATOR 


1514 


SERINE METABOUSM 


1515 


PHOSPHOSERINE PHOSPHATASE 


1516 


MUTAGENESIS 
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A resulting entry for a peptide sequence based on these criteria will have the 
following format: SEQ ID NO of peptide entry ] Numeric code corresponding to cell 
type and HLA type | SEQ ID NOs of source protein reference(s) | source protein 
symbol(s) | Numeric keys corresponding to biological classification(s). This ordering 
5 corresponds to Criteria 1 [ Criteria 2 ] Criteria 3 ] Criteria 4 1 Criteria 5. 

Exam ple 2: Isolation and Characterization of Global Peptide Tags (GPD 

This example describes the use of enzymatic or chenucal digestion strategies to 

reduce proteins of a complex mixture into peptides. These peptides are called Global 
10 Peptide Tags (GPT). The peptides were separated and fractionated by multiple modes of 

chromatography and ultimately sequenced by liquid chromatography on-line with tandem 

mass spectrometry (LC/MS/MS). 

Prior to digestion, all proteins of a sample were denatured using high 

concentrations of chemical denaturants (such as 6-8 M Urea or 6-8 M guanidine 
15 hydrochloride), elevated temperature, or a combination of both chemical denaturants and 

elevated temperature. Additionally, reactive thiol groups were typically reduced by the 

action of dithiothreitol (DTT) or Tris[2-carboxyethylphosphine] hydrochloride (TCEP) at 

a molar concentration of 25 to 50 times greater than that of the total protein 

concentration, and alkylated with an alkylating agent (at a molar concentration of 25 to 
20 50 times greater than that of the total protein concentration) such as iodoacetamide or 

iodoacetic acid. The reaction was typically carried out at room temperature and in the 

dark. 

Two rounds of reduction and alkylation were usually used for each protein 
mixture to ensure complete reduction and alkylation of reactive thiol groups. Solutions 

25 were typically made in a 50-100 mM ammonium bicarbonate solution at an approximate 
pH of 8.2. Subsequently, the reduced and alkylated protein mixture was concentrated to 
a volume of approximately 50-100 uL under vacuum in a Speedvac™ centrifugal 
concentrator (ThermoSavant Scientific) used at ambient temperature. This process also 
removes the majority of the excess DTT that is used to quench the final alkylation 

30 reaction. Followdng concentration, the resultant solution was re-diluted with the 

ammonium bicarbonate solution and an enzyme was added at a weight to weight ratio of 
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1 part enzyme to 25-40 parts protein. Those protein mixtures that were chemically 
denatured were diluted with the ammonium bicarbonate solution to reduce the 
concentration of the chemical denaturant to less than 1 M. The enzyme was also 
dissolved in the ammonium bicarbonate solution. Enzymes that have been used to 

5 generate peptides from a complex protein mixture include trypsin and lysine 

endopeptidase. All enzymatic digestions were carried out overnight (typically 18 to 26 
hours) at a temperature of After protein digestion, enzymes were deactivated using 
10% acetic acid solution, and peptides were separated from undigested protein and 
isolated by ultra filtration using either a 3 kDa or 5 kDa spin filtration device. 

10 Alternatively, total protein digest are generated chemically using cyanogen bromide. 
Again, peptides were isolated from undigested proteins by ultra filtration. 

Peptide-rich solutions were separated by multiple modes of chromatography. The 
first mode was usually strong cation exchange (SCX) using a stationary phase such as 
polyethylaspartanoide (from PolyLC Inc.) and an aqueous mobile phase that was modified 

15 with acetonitrile (5-15% v/v) and developed a salt gradient from 0 to 1 M salt to elute the 
adsorbed peptides. Each of the peptide-rich fractions that was isolated by SCX 
chromatography was further separated and fractionated using a C18 reversed phase 
microbore (1 mm id) column and mobile phases that were modified with trifluoroacetic 
acid and developed an acetonitrile gradient. Peptide-rich fractions that were isolated by 

20 reversed phase chromatography were subjected to on-line LC/MS/MS using a further 
dimension of reversed phase chromatography. Peptide sequence elucidation was by 
database searching raw MS/MS spectra against publicly available protein sequence 
databases. 

25 Table 1 contains the sequences of a series of EPTs and GPTs identified as 

described in Examples 1 and 2. The conventions detailed in Example 1 are used to 
describe EPTs and GPTs, with the exception of "HLA source" which does not apply to 
GPTs. 
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Other Embodiments 
While the invention has been described in conjunction with the detailed 
description thereof, the foregoing description is intended to illustrate and not limit the 
scope of the invention, which is defined by the scope of the appended claims. Other 
5 aspects, advantages, and modifications are within the scope of the following claims. 
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What is claimed is: 

1. An isolated nucleic acid comprising a nucleotide sequence that encodes a 
polypeptide comprising an amino acid sequence which is at least 95% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

5 2. An isolated nucleic acid comprising a nucleotide sequence that encodes a 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-235. 

3. The nucleic acid of claim 2, wherein the peptide sequence is identical to that of 
10 a naturally processed class I or class n MHC-binding peptide. 

4. An isolated nucleic acid comprising a nucleotide sequence that encodes a 
polypeptide comprising an amino acid sequence selected from the group consisting of a 
variant of any one of SEQ ID NOs: 1-235, wherein the variant has no more than two 

15 conservative amino add substitutions. 

5. An isolated nucleic acid comprising a nucleotide sequence that encodes a 
polypeptide comprising at least 8 contiguous amino acids of an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-235. 

20 

6. An isolated nucleic acid comprising a nucleotide sequence encoding a 
polypeptide consisting of an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-235. 

25 7. An isolated nucleic acid comprising a nucleotide sequence encoding a 

polypeptide consisting essentially of an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 1-235. 

8. An isolated nucleic acid comprising a nucleotide sequence encoding a 
30 polypeptide comprising no more than 30 contiguous amino acids of a naturally occurring 
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human protein, wherein the naturally occurring protein comprises an amino acid 
sequence selected from the group consisting of SEQ TD NOs: 1-235. 

9. The nucleic acid of claim 8, wherein the peptide sequence is identical to that of 
5 a naturally processed class I or class n MHC-binding peptide, 

10. A purified polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs: 1-235. 

10 11. The polypeptide of claim 10, wherein the peptide sequence is identical to that 

of a naturally processed class I or class n MHC-binding peptide. 

12. A purified polypeptide comprising at least 8 contiguous amino acids of an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235. 

15 

13. A purified polypeptide, comprising at least an immunogenic portion of a 
protein, wherein the protein comprises an amino acid selected from the group consisting 
of SEQ ID NOs: 1-235. 

20 14. A purified inconunogenic polypeptide comprising at least 8 contiguous amino 

acids of an amino acid sequence selected fix)m the group consisting of SEQ ID NOS: 1- 
235. 

15. A purified polypeptide comprising an amino acid sequence selected from the 
25 group consisting of SEQ ID NOs: 1-235, wherein the purified polypeptide comprises at 

least 25 amino acids. 

16. The purified polypeptide of claim 14 wherein the polypeptide comprises 
fewer than 100 amino acids. 

30 
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17. The purified polypeptide of claim 15 wherein the polypeptide comprises 
fewer than 50 amino acids. 

18. A purified polypeptide consisting of an amino acid sequence selected from 
the group consisting of SEQ ID NOs:l-235. . 

19. A purified polypeptide consisting essentially of an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-235. 

20. A vector comprising the nucleic acid of claim 1. 

21. The vector of claim 20 wherein the vector comprises expression control 
sequences that direct the expression of the polypeptide. 

22. The vector of claim 20 wherein the vector comprises expression control 
sequences that direct expression of the nucleic acid molecule. 

23. Acellcomprisingthe vector of claim 20. 

24. An antibody that selectively binds a polypeptide consisting of an amino add 
sequence selected from the group consisting of SEQ ID NOs:l-235. 

25. A method of making an antibody, the method comprising: 

(a) providing a polypeptide comprising an amino acid sequence selected fix)m tiie 
group consisting of SEQ ID NOs: 1-235 to a mammal in an amount effective to induce the 
production of an antibody that binds to the polypeptide; 

(b) isolating firom die mammal a cell that produces an antibody that selectively 
binds to a polypeptide consisting of an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 1-235; 

(c) immortalizing die cell isolated in step (b); and 

(d) isolating antibodies from the immortalized cell. 
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26. The antibody of claim 24 wherein the polypeptide is expressed on a cell surface. 

27. The antibody of claim 24, wherein the polypeptide is a target of a second 
antibody located on a cell surface. 

28. A humanized antibody which specifically binds to a domain of a polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ E> NOs: 1- 
235. 

29. The humanized antibody of claim 28 which is a full length antibody. 

30. The humanized antibody of claim 28 which is a human IgG. 

31. The humanized antibody of claim 28 which is an antibody fragment. 

32. The humanized antibody of claim 28 wherein the antibody fragment is a 

F(ab72. 

33. A labeled antibody comprising the humanized antibody of claim 28 bound to 
a detectable label. 

34. An immobilized antibody comprising the humanized antibody of claim 28 
bound to a solid phase. 

35. A conjugate comprising the humanized antibody of claim 28 bound to a 
cytotoxic agent. 

36. A method for determining flie presence of a protein comprising exposing a 
sample suspected of containing tiie protein to the humanized antibody of claim 28 and 
determining binding of said antibody to the sample. 
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37. A kit comprising the humanized antibody of claim 28 and instructions for 
using the humanized antibody to detect a protein that binds to the antibody, 

5 38. An isolated nucleic acid encoding the humanized antibody of claim 28. 

39. A method for modulating the activity of the polypeptide of claim 10, the method 
comprising contacting the polypeptide with a compound that binds to the polypeptide in a 
concentration sufficient to modulate the activity of the polypeptide. 

10 

40, The method of claim 39, wherein the compound that binds the polypeptide is an 
antibody that selectively binds a polypeptide consisting of an amino acid sequence 
selected for the group consisting of SEQ ID NOs: 1-235, 

15 4 1 . A method of treating a disorder in a mammal, the method comprising: 

(a) identifying a mammal with the disorder, and 

(b) administering to the mammal a compound that modulates the expression or 
activity of the polypeptide of claim 10, 

wherein the administration results in an amelioration of one or more symptoms of 
20 the disorder. 

42. A method for detecting the presence of a polypeptide of claim 10 in a sample, 
the method comprising: 

contacting the sample with a compound that selectively binds to a polypeptide 
25 consisting of an amino add sequence selected firom the group consisting of SEQ K) NOs: 1- 
235; and 

determining whether the compound binds to the polypeptide in the sample. 

43. A method for detecting the presence of a disorder in a manraial, the method 
30 comprising: 

(a) providing a biological sample derived from the manuxial; 
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(b) contacting the sample with a compound that binds to the polypeptide of claim 
17 or to a nucleic acid that encodes the polypeptide of claim 17; and 

(c) determining whether the compound binds to the sample, 

wherein binding of the compound to the sample indicates the presence or absence of 
the disorder in the mammal. 

44. A method for imaging a site in a mammal, the method comprising: 

(a) administering a compound to a mammal, wherein the compound binds to the 
polypeptide of claim 10 or to a nucleic acid that encodes the polypeptide; and 

(b) detecting the compound with an imaging detector, thereby imaging the site in 
the manmial. 

45. A method for identifying a compound that modulates the activity of the 
polypeptide of claim 10, the method comprising: 

(a) contacting the polypeptide of claim 8 with a test compound; and 

(b) determining the effect of the test compound on the activity of the polypeptide, to 
thereby identify a compound that modulates the activity of the polypeptide. 

46. A method for identifying a compound that modulates the expression of the 
nucleic acid of claim 2, the method comprising: 

contacting the nucleic acid of claim 2 with a test compound; and 
determining the effect of the test compound on the expression of the nucleic acid, to 
thereby identify a compound that modulates the expression of the nucleic acid 

47. A polypeptide profile that is characteristic of a given cell, wherein the profile 
comprises a representation of at least ten.different polypeptides in the cell, wherein each of 
the at least ten different polypeptides comprises an amino acid sequence selected from the 
group consisting of SEQ ID NOs: 1-235, and wherein the polypeptide profile is a 
reproducible characteristic of the cell. 
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48. The polypeptide profile of claim 47, wherein the each of the at least ten different 
polypeptides comprises an MHC-binding polypeptide. 

49. The polypeptide profile of claim 47, wherein the representation characterizes 
5 each individual polypeptide based upon at least one physical or chemical attribute of the 

polypeptide, the at least one physical or chemical attribute comprising an amdno acid 
sequence. 

50. The polypeptide profile of claim 47, wherein the representation characterizes 
10 each individual peptide based upon at least two physical or chemical attributes. 

51. The polypeptide profile of claim 47, wherein one of the physical or chemical 
attributes is a nucleotide sequence encoding the amino acid sequence. 



15 52. The polypeptide profile of claim 47, wherein one of the physical or chemical 

attributes is mass-to-charge ratio. 

53. The polypeptide profile of claim 47, wherein one of the physical or chemical 
attributes is an ion-firagmentation pattern. 

20 

54. The polypeptide profile of claim 47, wherein the representation characterizes 
each individual peptide based upon at least three physical or chemical attributes. 

55. A database, stored on a machine-readable medium, comprising: 

25 (a) two categories of data respectively representing: (i) peptide profiles and (ii) cell 

sources; and 

(b) associations among instances of the two categories of data, 
wherein the data representing polypeptide profiles comprise the peptide profile of 
claim 46, and 

30 wherein the database configures a computer to enable finding instances of data of 

one of the categories based on their associations with instances of data the other category. 
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56. A method of selecting an antibody, the method comprising: 

• (a) contacting the polypeptide of claim 10 with an in vitro library of antibodies; 
(b) binding an antibody to the polypeptide; and 
5 (c) selecting the antibody that binds to the polypeptide. 

57. An inDutnunogenic composition comprising a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 1-235, the composition 
when injected into a manmial eliciting an immunogenic response directed against a 

10 polypeptide consisting of an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-235. 



58. A method for treating a cancer comprising administering to a patient an 
amount of a composition comprising a polypeptide comprising an amino acid sequence 
15 selected from the group consisting of SEQ ID NOs: 1-235 in an amount sufficient to elicit 
an immunogenic response. 



59. A method for treating a cancer patient, the method comprising administering 
to the patient an antibody that selectively binds to a peptide consisting of an amino acid 

20 sequence selected from the group consisting of SEQ ID NOs: 1-235. 

60. A peptide array comprising at least 100 peptides selected from the group 
consisting of peptides consisting of an amino acid sequence selected from the group 
consisting of SEQ ID NOs:l-235, each peptide linked to a solid support at a known 

25 location. 



61. A collection of at least 10 polypeptide arrays, each array comprising at least 
100 polypeptides consisting of an amino acid sequence selected from the group consisting 
of SEQ ID NOs: 1-235, each peptide linked to a solid support at a known location. 

30 
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62. A method for identifying a compound that binds to a naturally processed class 
I MHC-binding polypeptide, the method comprising exposing a test compound to a 
collection of at least 100 polypeptides selected from the group consisting of polypeptides 
having an amino acid sequence selected from the group consisting of SEQ H) NOs: 1-235, 

5 and identifying a peptide to which the test compound binds. 

63. A method for identifying a compound that binds to a naturally processed class 
n MHC-binding polypeptide, the method comprising exposing a test compound to a 
collection of at least 100 polypeptides selected from the group consisting of polypeptides 

10 having an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235 
and identifying a peptide to which the test compound binds. 

64. A database, stored on a machine-readable medium, comprising: 

(a) three categories of data respectively representing (i) polypeptides, (ii) cell 
15 sources, and (iii) cell treatments; and 

(b) associations among instances of the three categories of data, 

wherein the data representing peptides comprises at least 100 polypeptides each 
having an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-235, 
and 

20 wherein the database configures a computer to enable finding instances of data of 

one of the categories based on their associations with instances of data of at least one 
other category. 

65. A polypeptide profile that is characteristic of a selected cell under selected 
25 conditions, wherein the profile comprises a representation of at least ten different 

polypeptides expressed by the cell, wherein each of the at least ten different polypeptides 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NOs: 1-235, and wherein the polypeptide profile is a reproducible characteristic of the 
ceU. 

30 



133 



